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^ (57) Abstract: The present invention relates to genes cloned from the plant pathogens Cochliobolus heterosirophus, Aliernaria 
® solani, Fusarium graminearium, and Pyrenophora teres , that encode a CPS1 peptide synthetase required for fungal pathogenesis. 
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^ encoded by the CPS1 genes. Other aspects of the invention relate to a method of imparting disease resistance to a plant by overex- 
pressing a protein of the present invention in a plant and a method for identifying inhibitors of a CPS1 protein in a sample. 
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PEPTIDE SYNTHETASE GENE CPS1 

CROSS REFERENCE TO RELATED APPLICATIONS 

This application is a continuation in part application of U.S. 
Application Serial No. 09/448,215, filed November 23, 1999. 

5 STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH 

The present invention was made with support from the United 
States Government under Grant No. 96-35303-3198 from the USDA/NRI. The 
United States Government may have certain rights in the invention. 

FIELD OF THE INVENTION 

1 o The present invention relates to nucleic acid molecules encoding 

peptide synthetase homologs of Cochliobolus heterosirophus, Pyrenophora teres, 
Fusarium graminearium, and Alternaria solani and uses thereof. 

BACKGROUND OF THE INVENTION 

There are approximately 30 species included in the genus 

1 5 Cochliobolus, nearly all of which are pathogens of wild grasses or cereals (Yoder 
et al, "Cochliobolus spp. And Their Host-Specific Toxins, in Carroll, eds., The 
Mvcota Vol. 5: Plant Relationships, Part A , BerlimSpringer-Verlag, pp. 145-166 
(1997)). Cochliobolus heterosirophus represents the most widely distributed 
species in the genus and can be found in many tropical and subtropical areas in the 

20 world. As a natural pathogen of corn, C heterosirophus causes a disease 

frequently called leaf spot of maize in the old literature (Drechsler, "Leafspot of 
Maize Caused by Ophiobolus Heterosirophus n. sp., The Ascigerous Stage of a 
Helminthosporium Exhibiting Bipolar Germination," J. Agr. Res., 3 1 :70 1-726 
(1925); Drechsler, "Phytopathological and Taxonomic Aspects of Ophiobolus, 

25 Pyrenophora, Helminthosporium, and a New Genus, Cochliobolus" Phytopathol., 
24:953-983 (1934); Yu, "Studies on Helminthosporium Leaf Spot of Maize," 
3:273-318 (1933); Orillo, "Leafspot of Maize Caused by Helminthosporium 
maydis," 36:327-395 (1952). In the United States, C heterosirophus is usually 
found in the warmer southern states, thus, the disease is commonly known as 

30 Southern Corn Leaf Blight (Hooker, "Cytoplasmic Susceptibility in Plant 
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Disease," Ann. Rev. Phvtopathol , 12:167-179 (1974)). For many years, Southern 
Com Leaf Blight was only known as an endemic disease and was not considered 
to be major economic importance in the United States. But in 1970, it suddenly 
broke into a severe epidemic that destroyed 15% of the U.S. corn crop and caused 
5 losses estimated at more than $ 1 billion. This serious damage made Southern 
Corn Leaf Blight one of the most widely known crop diseases in the U.S. 

Prior to the outbreak of the disease, only one race of C. 
heterostrophus (race O) was known in the field. In late 1969 when the disease 
became an epidemic, a new race of the fungus was identified from infected corn 

1 0 leaves collected in severely diseased areas. It was soon designated as race T 
because of its high virulence on T-cytoplasm corn and the ability to produce a 
phytotoxin called T-toxin, which specifically affects T-corn. In contrast, race O 
does not produce T-toxin and is mildly virulent on both T-cytoplasm and N- 
cytoplasm (normal cytoplasm) corn (Hooker et al., "Physiological Races of 

1 5 Helminthosporium maydis and Disease Resistance," Plant Pis. Reptr., 54:11 09- 
1110 (1970); Scheifele, "Cytoplasmically Inherited Susceptibility to Diseases as 
Related to Cytoplasmically Controlled Pollen Sterility in Maize," 25:1 10-138 
(1970); Smith et al., "Physiologic Races of Helminthosporium maydis" 54:819- 
822 (1970); Yoder et al., "Segregation of Pathogenicity Types and Host-Specific 

20 Toxin Production in Progenies of Crosses Between Races T and O of 

Helminthosporium maydis (Cochliobolus heterostrophus); 1 Phytopathology, 
65:273-276 (1975); Yoder, "Evaluation of the Role of Helminthosporium maydis, 
Race T Toxin in Southern Cora Leaf Blight, in Tomiyama, eds., Biochemistry and 
Cvtoloev of Plant Parasite Interaction. New York, New York:Elsevier, pp. 1 6-24 

25 (1 976); Yoder, "Toxins in Pathogenesis," Ann Rev. PhvtopathoL 1 8: 1 03- 1 29 
(1980)). T-cytoplasm stands for Texas male sterile cytoplasm, a unique 
cytoplasm with a trait for maternally inherited male sterility, characterized by the 
failure to produce pollen (Levings, "The Texas Cytoplasm of Maize: Cytoplasmic 
Male Sterility and Disease Susceptibility," Science . 250:942-947 (1990)). T- 

30 cytoplasm corn was widely used for hybrid seed production and breeding to avoid 
hand or mechanical emasculation in the 1950s and the 1960s. It was the 
coexistence of large acreages of intensively planted T-cytoplasm corn and the 
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sudden appearance of race T of C. heterostrophus that resulted in the epidemic of 
the disease in 1970. This discovery first opened the door to understanding 
pathogenesis by C. heterostrophus. 

Early genetic analysis suggested that both T-toxin production and 

5 high virulence on T-cytoplasm com are controlled by a single genetic locus 

defined as Toxl (Leach et al., "Dominance at the Toxl Locus Controlling T-Toxin 
Production by Cochliobolus heterostrophus" Physiol. Plant Pathol., 21:327-333 
(1982)). This was demonstrated by crosses between race T and race O in which 
only parental phenotypes segregated in a 1:1 ratio (Tox + :Tox"); all T-toxin 

1 0 producing progeny are highly virulent on T-cytoplasm corn while all T-toxin 
nonproducing progeny are weakly virulent (Yoder et al., "Segregation of 
Pathogenicity Types and Host-Specific Toxin Production in Progenies of Crosses 
Between Races T and O of Helminthosporium maydis {Cochliobolus 
heterostrophus), Phytopathology . 65:273-275 (1975); Leach et al., "Dominance at 

1 5 the Toxl Locus Controlling T-Toxin Production by Cochliobolus heterostrophus? 
Ph ysiol. Plant Pathol.. 21 :327-333 (1982)). Further investigation by comparison 
of electrophoretic karyotypes and chromosome-specific DNA hybridizations 
indicated that Toxl is tightly linked to a reciprocal translocation breakpoint and is 
associated with as much as a megabase of DNA (mostly highly repeated and A+T- 

20 rich) that is missing in race O (Bronson, "Ascospore Abortion in Crosses of 
Cochliobolus heterostrophus Heterozygous for the Virulence Locus Toxl ," 
Genome , 30:12-18 (1988); Tzeng et al., "A Restriction Fragment Length 
Polymorphism Map and Electrophoretic Karyotype of the Fungal Maize Pathogen 
Cochliobolus heterostrophus," Genetics . 130(l):81-96 (1992); Chang et al., "A 

25 Reciprocal Translocation and Possible Insertions) Tightly Associated with Host- 
Specific Virulence in Cochliobolus heterostrophus" Genome, 39(3):549-557 
(1996)). Surprisingly, recent analyses of several Toxr mutants revealed that Toxl 
is not a single locus but rather two loci, each on a different translocated 
chromosome (Yoder et al., "Molecular Determinants of the Plant/Fungus 

30 Interaction," in Kohmoto, eds., Host-Specific T oxin: Biosynthesis, Receptor and 
Molecular Biology. Tottori, JapamFaculty of Agriculture, Tottori Univ., pp. 23-32 
(1994); Turgeon et al., "Function and Chromosomal Location of the Cochliobolus 
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heterostrophus Toxl Locus " Can. J. Bot., 73 (suppl. 1 Sect. E-H):S1071-S1076 
(1995)). These two Toxl loci have been designated Toxl A and ToxlB (Yoder et 
al, "Cochliobolus spp. and Their Host-Specific Toxins," in Carroll, eds., Jhe 
Mvcota Vol. 5: Plant Relationships. Part A, BerlinrSpringer-Verlag, pp. 145-166 
5 (1997)). Two genes PKS1 and DEC! have been cloned from the two loci 

respectively; both are required for biosynthesis of T-toxin and are found only in 
race T isolates of C. heterostrophus (Yang, "The Molecular Genetics of T-Toxin 
Biosynthesis by Cochliobolus heterostrophus;' Ph.D. Thesis, Cornell University 
(1995); Yang et al., "A Polyketide Synthase is Required for Fungal Virulence and 
10 Production of the Polyketide T-Toxin," Plant Cell , 8(1 1):2139-2150 (1996); Rose 
et al, "A Decarboxylase Required for Poloyketide Toxin Production and High 
Virulence by Cochliobolus heterostrophus? 8 th Int. Svmp . Mol. Plant-Microbe 
Int> Knoxville, p. J-49 (1 996)). 

Genetic analysis also suggested that T-toxin is required by C. 

1 5 heterostrophus for its high virulence on T-cytoplasm corn. This hypothesis was 
first tested by the generation of induced T-toxin deficient mutants using different 
mutagenesis procedures. All mutants with a tight Tox" phenotype cause disease 
symptoms that are indistinguishable from those caused by race O when tested on 
both T and N-cytoplasm corn, suggesting that T-toxin is indeed a virulence factor 

20 (Yang et al., 1992; Lu et al., "Tagged Mutations at the Toxl Locus of 

Cochliobolus heterostrophus Using Restriction Enzyme-Mediated Integration," 
Proc. Natl. Acad. Sci. USA , 91 : 12649-12653 (1994); Rose et al., "A 
Decarboxylase Required for Polyketide Toxin Production and High Virulence by 
Cochliobolus heterostrophus," 8 th Int. Svmp. Mol. Plant-Microbe Int. , Knoxville, 

25 p. J-49 (1996)). This conclusion was firmly supported by the site-specific 

disruption of the PKS1 or DEC1 in the wild type race T genome; disruptants lost 
the ability to produce T-toxin and caused race O type symptoms on both T-corn 
and N-corn (Yang et al., "A Polyketide Synthase is Required for Fungal Virulence 
and Production of the Polyketide T-Toxin," Plant Cell 8(1 1):2139-2150 (1996); 

30 Rose et al., "A Decarboxylase Required for Polyketide Toxin Production and 
High Virulence by Cochliobolus heterostrophus? 8 th Int. Svm p. Mol. Plant- 
Microbe Int. , Knoxville, p. J-49 (1996)). These experiments have given a very 
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clear resolution for the role of T-toxin in pathogenesis. They also implied that 
pathogenesis by C. heterostrophus must involve additional pathogenicity factors 
because race O which does not produce T-toxin and race T-derived Toxr mutants 
are effective pathogens on corn. 

5 A number of fungal molecules have been identified as general 

pathogenicity or virulence factors in several plant pathogenic fungi (Yoder et al., 
"Molecular-Genetic Evaluation of Fungal Molecules for Roles in Pathogenesis in 
Plants ," J. Genet. , 75(3):425-440 (1996)). These include potential penetration 
factors such as melanin (Guillen et al., "Linkage Among Melanin Biosynthetic 
1 0 Mutations in Cochliobolus heterostrophus" Fungal Genet. Newsl., 41:41-42 
(1994)), cutinase (Oeser et al., "Pathogenesis by Cochliobolus heterostrophus 
Transformants Expressing a Gene Encoding Cutinase from Nectria 
haematococca," Mnl Plant-Microbe Int. . 7:282-288 (1994)) and 
polygalacturonase and xylanase (Lyngholm et al., "Mutants of Cochliobolus 
1 5 heterostrophus Deficient in Extracellular Enzymes," Funpal Genet. Newsl., 42:46- 
47 (1995)) or possible mechanisms involved in colonization such as phytotoxin 
detoxification (Schafer et al., "One Enzyme Makes a Fungal Pathogen, But Not a 
Saprophyte, Virulent on a New Host Plant," Science. 246:247-249 (1989)) or 
components of signal transduction pathways (Horwitz et al., "A G Protein Alpha 
20 Subunit Gene From the Corn Pathogen Cochliobolus heterostrophus is Involved 
in Two Complex Developmental Pathways: Mating and Appressorium Formation 
(unpublished) (1997)). Although C. heterostrophus is known to produce a 
nonhost specific toxin called ophiobolin (or cochliobolin), a C 25 sesterterpenoid 
compound, which is toxic to many organisms, including plants, bacteria, fungi and 
25 nematodes, there is no evidence that ophiobolins are involved in pathogenesis by 
C. heterostrophus or other phytopathogenic fungi. No other pathogenesis-related 
toxins have been isolated from C. heterostrophus so far, but studies on closely 
related Cochliobolus species and other phytopathogenic fungi suggest that 
pathogenesis by this group of fungi also involves peptide toxins. 

30 Four peptide phytotoxins (victorin, HC-toxin, AM-toxin, and 

enniatins) have been characterized as pathogenicity or virulence factors. They are 
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all small cyclic peptides (4-6 residues), containing unusual amino acids or 
hydroxy acids, and they can be either host specific or non-host specific in terms of 
plant toxicity. A number of peptide phytotoxins are believed to be synthesized 
nonribosomally. Early in the 1960s, several biochemists working on the bacterial 
5 peptide antibiotics gramicidin and tyrocidine found that these polypeptides can be 
synthesized in RNAase-treated particle-free extracts of Bacillus brevis that are 
known to produce the same antibiotics; adding protein-synthesis inhibitors to the 
extracts does not affect this process. This indicated the existence of a peptide 
biosynthetic system in which ribosomes and mRNAs are not needed. Further 

1 0 studies revealed that in this system, peptides are synthesized on a protein-template 
and this template itself is a multifunctional enzyme or a complex of several such 
enzymes, collectively called peptide synthetases, catalyzing the biosynthetic 
process (Laland et al., "The Protein Thiotemplate Mechanism of Synthesis for the 
Peptide Antibiotics Produced by Bacillus Brevis" Essays in Biochemistry . 7:31- 

1 5 57 (1 973); Lipmann, "Bacterial Production of Antibiotic Polypeptides by Thiol- 
Linked Synthesis on Protein Templates," Adv. Microbiol. Physiol., 21:277-266 
(1980)). 

Peptide synthetases can catalyze biosynthesis of a variety of 
peptides. In terms of bioactivity, they can be antibiotics, enzyme inhibitors, plant 

20 or animal toxins and immunosuppressants (Stachelhaus et al., "Modular Structure 
of Peptide Synthetases Revealed by Dissection of the Multifunctional Enzyme 
GrsA.," Journal of Biological Chemistry. 270(1 1):6163-6169 (1995)). In terms of 
chemical structure, they can be either linear (i.e. ACV, the penicillin precursor and 
gramicidin) or cyclic (most are). The latter can be further classified into three 

25 subgroups: 1) The "standard" cyclic peptides (i.e. gramicidin S, tyrocidine, HC- 
toxin and cyclosporin); 2) cyclic lactones (i.e. destruxin); 3) cyclic depsipeptides 
(i.e. beauvericin and enniatin). There have been over 300 different carboxy 
compounds that can be activated by peptide synthetases. 

Although the first peptide synthetase, Gramicidin S synthetase, was 
30 purified and used for the cell-free synthesis of the peptide early in the 1960s 
(Tomino et al., "Cell-Free Synthesis of Gramicidin S," Biochem. , 6:2552-2560 
(1967)), the first bacterial peptide synthetase gene, tycA, which encodes the 



SUBSTITUTE SHEET (RULE 26) 



WO 01/38489 PCT/US00/32227 



tyrocidine synthetase 1 in B. brevis was not cloned until almost twenty years later 
(Marahiel et al., "Cloning of the Tyrocidine Synthetase 1 Gene from Bacillus- 
brevis and Its Expression in Escherichia-coli, Mol. Gen. Genet., 201(2): 1986 
(1985)). Since then, more than twenty peptide synthetase genes have been 
5 reported for both bacteria and filamentous fungi, but only fourteen have complete 
nucleotide sequences published. All are larger than 3.3 kb and range between 3.3- 
19.5 kb for bacterial genes and 9.4-45.8 kb for fungal ones. Interestingly, all 
fungal peptide synthetase genes reported lack introns, even the cyclosporin A 
synthetase gene simA, which has a 45.8 kb of open reading frame (the largest 

1 0 genomic ORF so far recorded). Although biosynthesis of bacterial peptides 
differs from that of fungal ones in terms of the number of multifunctional 
enzymes involved, the genes encoding these enzymes are similar to each other in 
both function and structure. Comparison of nucleotide sequences reveals one or 
more highly conserved regions at certain positions in each peptide synthetase 

1 5 gene. These regions formerly called "amino acid activating domains" 

(Stachelhaus et al., "Modular Structure of Peptide Synthetases Revealed by 
Dissection of the Multifunctional Enzyme GrsA," Journal of B iological 
Chemistry , 270(1 1):6163-6169 (1995)), now called "amino acid activating 
modules" (Marahiel, "Protein Templates for the Biosynthesis of Peptide 

20 Antibiotics," Chem. Biol. , 4(8):561-567 (1997)) consist of a set of domains 
(formerly called "modules") believed to have specific functions such as 
recognization, activation and thioesterification of individual constituent amino or 
hydroxy acids, and in some cases methylation and racemation for modification of 
certain residues before incorporation into the peptide chain (Stachelhaus et al., 

25 "Modular Structure of Peptide Synthetases Revealed by Dissection of the 

Multifunctional Enzyme GrsA," Journal of Biological Chemistry , 270(1 1):6163- 
6169 (1995)). The most convincing evidence supporting this assignment is that in 
most cases, the number of conserved functional units in each gene or gene cluster 
is equal to the number of amino acids in the respective peptide. This one-for-one 

30 match is very clear between three of four fungal peptides and their biosynthetic 
genes. The total number of modules in three of four bacterial gene clusters also 
matches the number of amino acids in the respective peptides. 
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Sequence alignment of amino acid-activating modules reveals 
strictly conserved sequence motifs that contain active residues for module 
functions. These motifs are called "core sequences" (Marahiel, "Multidomain 
Enzymes Involved in Peptide Synthesis," FEBS Lett. , 307(l):40-43 (1992)). A 

5 minimal amino acid-activating module must contain six core sequences, whose 
functions (except for core 1) have been proposed based on mutational analysis of 
several peptide synthetases. Core sequences 1-5 are grouped into an amino acid 
adenylation domain and core 6 is a thioester formation domain (Figure 1 A). All 
bacterial peptide synthetase genes contain "type I modules" - the minimal amino 

1 0 acid activating modules which were previously called "type I domains" 

(Stachelhaus et al., "Modular Structure of Peptide Synthetases Revealed by 
Dissection of the Multifunctional Eiuyme GrsA," Journal of Biological 
Chemistry , 270(1 1):6163-6169 (1995)). Two fungal genes, acvA and HTS1 also 
have this modular structure. In addition to the type I module, two fungal genes, 

1 5 esynl and simA, contain type II modules, in which an insertion (about 400 amino 
acids) is found between cores 5 and 6 of a normal type I module. This region 
contains a motif (VLE/DXGXGXG (SEQ. ID. No. 1)), highly conserved in S- 
adenosyl-methionine (SAM)-dependent methyltransferases, hence, it is referred to 
as a W-methylation domain (Figure 1A). Additional evidence for 

20 methyltransferase activity of this module is that the number and position of type II 
modules in esynl and simA exactly match that of TV-methylated amino acids in 
ennatin and cyclosporin sequences (Figure IB). 

Although the modular structure described above is highly 
conserved among most peptide synthetase genes, some variations have been found 

25 in the latest cloned peptide synthetase gene sq/B, which is the first gene in the 
saframycin Mxl synthetase gene cluster (Pospiech et al., "A New Myxococcus 
xanthus Gene Cluster for the Biosynthesis of the Antibiotic Saframycin Mxl 
Encoding a Peptide Synthetase," Microbiology, 141(8):1793-1803 (1995)). sq/B 
contains two type I amino acid activating modules. One module has all six highly 

30 conserved core sequences, but another, believed to activate alanine (the first 

amino acid in the linear tetrapeptide precursor of saframycin Mxl), lacks core 5 
and has a weakly conserved core 1 (Pospiech et al., "Two Multifunctional Peptide 
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Synthetases and an O-methyltransferase Are Involved in the Biosynthesis of the 
DNA-Binding Antibiotic and Antitumour Agent Saframycin Mxl from 
Myxococcus xanthus," Microbiology, 142(4): 74 1-746 (1996)) (Figure 1 A). This 
suggests that some of the motifs in the amino acid adenylation domain are 
5 dispensable or not critical for domain function. It also raises the possibility that 
other variations might be found in yet unknown peptide synthetase genes. 

Although C. heterostrophus has been a model eukaryotic plant 
pathogen since the 1 970s, most molecular genetic analyses conducted in this . 
system have focused on production of the polyketide T-toxin by race T isolates of 

10 the fungus. Solid evidence now indicates that T-toxin is a host-specific virulence 
factor in Southern Corn Leaf Blight (Yoder et al., "Molecular-Genetic Evaluation 
of Fungal Molecules for Roles in Pathogenesis in Plants," J. Genet. , 75(3):425- 
440 (1996); Yoder et al., Cochliobolus spp. and Their Host-Specific Toxins, 
Carroll eds., Th* Mvcota Vol S- Plant Relationships, Part A, Berlin.Springer- 

15 Verlag, pp. 145-166 (1997)). It is clear, however, that C. heterostrophus needs 
additional factors, presumably general factors for pathogenesis to corn plants, 
since race O, which does not produce T-toxin, can be an effective corn pathogen. 
Attempts to identify additional general factors required by C. heterostrophus for 
pathogenesis have been unsuccessful. Cloning and characterizing additional C. 

20 heterostrophus genes that control biosynthesis of novel fungal molecules involved 
in critical pathogenic processes may be important because it offers potential 
targets for the design of products that might interfere with the corn plant infection 
process. The present invention is directed to achieving these objectives. 

SUMMARY OF THE INVENTION 

25 The present invention relates to an isolated nucleic acid molecule 

encoding CPS 1 peptide synthetase homologs. The DNA molecule comprises a 
nucleotide sequence which hybridizes to a DNA molecule having a sequence as 
set forth in at least one of SEQ. ID. No. 2, SEQ ID No. 41 , SEQ ID No. 43, or 
SEQIDNo.45. 

30 Another aspect of the present invention relates to a method for 

identifying inhibitors of a CPS1 protein or polypeptide which involves providing 



SUBSTITUTE SHEET (RULE 26) 



WO 01/38489 



PCT/US00/32227 



- 10- 

the CPS1 protein or polypeptide, contacting the protein with potential inhibitor 
compounds, determining peptide synthetase activity, and selecting compounds 
which decrease the peptide synthetase activity. 

Still another aspect of the present invention relates to a method of 
5 imparting disease resistance to a plant by overexpressing a CPS1 protein or 
polypeptide in the plant cell. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 provides the structure of amino-acid activating modules 
identified in peptide synthetase genes (adapted from Stachelhaus and Marahiel, 

10 1995; Pospiech 1995; Marahiel, 1 997). Figure 1 A shows the domain 

arrangements in two types of modules. Structural variations in the first module 
(safBl) of the gene safB are also indicated below type I. Figure IB shows the 
correlation between module types and the nature of residues in two fungal 
peptides. Open box: type I module; filled box: type II module. Each peptide 

1 5 sequence is given below. 

Figure 2 depicts the recovery of DNA flanking the REMI vector 
insertion site (arrows) in mutant R.C4.2696. Circled numbers indicate restriction 
enzyme sites used for recovery of each plasmid. p214B7 contains 4.2 kb flanking 
DNA (3.4 left; 0.7 right); p214Ml contains 0.1 kb left flank that overlaps with 
20 p2 1 4B7 and 1 . 1 kb right flank that overlaps with p2 1 4S 1 , which contains 3 .2 kb 
flanking DNA on the left only. 

Figures 3A-B illustrate the extension of the tagged sequence by 
targeted integration and plasmid rescue. Figure 3A shows a general scheme 
illustrating chromosome walking strategy (only one direction is indicated). "X" 

25 indicates a restriction enzyme site used for recovery of vector (indicated by "T") 
with flanking genomic DNA ("L" for left flank; "R" for right flank; R* for a 
fragment from M R" that is subcloned into a subsequent transformation vector). 
Each integration site is indicated by a vertical arrowhead pointed at the selectable 
marker on the vector. Overlapped flanking DNA is represented by bars drawn in 

30 the same pattern. Figure 3B shows two strategies used for construction of a 
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targeted integration vector. Left: A genomic DNA fragment (indicated by the 
letter "a") is subcloned into a vector directly and the subsequent integration occurs 
by a single crossover ( "a" is duplicated). Right: "a" is subcloned into a vector via 
multiple steps (see Figure 6 for an example) that allows linearization with 
5 restriction enzyme "Y" and the subsequent integration occurs by a double 
crossover resulting in a simple insertion. 

Figure 4 shows how the targeted integration vector p214SNP was 
constructed. The sites for restriction enzyme digestion are indicated by arrows. 
p214SNP was originally designed for a double crossover integration after digested 
1 0 with BstXl and Bell, but it was found later that the E. coli strain (DH5a) used for 
amplification of the plasmid is Dam + which blocks the Bell site (TGA TCA to 
TG m TCA). As an alternative, the plasmid was linearized with BstXl only and 
used for transformation. 

Figure 5 depicts the recovery of genomic DNA flanking the 
1 5 targeted integration vector p2 14SNP in transformant #118. The vector integrated 
into the target site by a single crossover that resulted in duplicated M«dIII-5acI 
fragments (indicated by letter b and b'). The second genomic DNA fragment 
carried by the vector is also duplicated in the genome (Narl-Hindlll, indicated by 
letter a and a'). Genomic DNA was digested with flg/11 (which does not cut the 
20 vector) or Bell (which cuts once in fragment a' on the vector) as indicated by 

numbers in a circle. The two recovered plasmids carry sequences that extend the 
right flank only by 170 and 800 bp respectively beyond the genomic DNA on 
P 214S1. 

Figure 6 shows how the targeted integration vector pi 1 8BSP was 
25 constructed. The sites for restriction enzyme digestion on the vectors are 
indicated by arrows, pi 18B14 contains two duplicated NarVHindmSacl 
fragments (numbered region, a + b = a' + b') but only one is present in pi 1 8BSP. 
pi 18BSP was linearized with Bghl and transformed into a wild type C. 
heterostrophus strain. 
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Figure 7 depicts the recovery of genomic DNA flanking the 
targeted integration vector pi 18BSP in transformant #9. The vector integrated 
into the target site by a double crossover. Genomic DNA was digested with Pstl 
and relegated. The recovered plasmid p9P2 includes the entire pUC18 sequence 
5 on pi 18BSP and 4.6 kb of genomic DNA that contains all of ORF1 (CPS1), 
including the stop codon (TAG) and 3.0 kb of genomic region 3' of the stop 
codon. 

Figure 8 shows how the targeted integration vector pi 18BCS was 
constructed. The sites for restriction enzyme digestion are indicated by arrows. 

10 Figure 9 depicts the recovery of genomic DNA flanking the 

targeted integration vector pi 18BSP in transformant #12. The vector integrated 
into the target site by a single crossover that resulted in a duplicated Sspl-Sspl 
fragment in the transformant genome (indicated by the letter "a"). Genomic DNA 
was digested with //mdlll (arrows) and ligated. The recovered plasmid pl2H6 

1 5 contains the entire pi 18BSP sequence and a 2.1 kb genomic DNA (Sacl-tfindlll) 
on the left region that overlaps with the sequence carried on p9P2 (see Figure 7). 

Figures 10A-B are photographs which show that the REMI mutant 
R.C4.2696 grows like wild type in culture (Figure 10A) and produces normal 
appressoria (Figure 10B). In Figure 10A, plates containing complete medium 

20 (CM) were inoculated with a conidia-bearing mycelium plug and incubated at 
22°C under warm white light (F40/350BL) (Sylvania Inc., Danvers, MA). The 
photograph in Figure 10A was taken 6 days after inoculation. Left to right: 
Mutant R.C4.2696; wild type. In Figure 10B, conidia of mutant (left) or wild type 
(right) were placed in a drop of water and incubated at 32°C for 6 hrs. No 

25 significant difference in percentage of appressorium-forming conidia (arrows) was 
detected. 

Figures 1 1 A-B are photographs which show that the REMI mutant 
R.C4.2696 produces wild type levels of T-toxin (Figure 1 IB) but has reduced 
virulence on T-cytoplasm corn (Figure 1 1 A). In Figure 1 1 A, two week old T- 
30 cytoplasm com plants were inoculated with conidial suspensions of (left to right) 
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mutant R.C42696, wild type race T, and a Tox+ mutant progeny from a cross 
between R.C4.2696 and a race 0 tester, and symptoms (represented by three 
diseased leaves for each strain) were recorded 5 days after inoculation. Leaves 
infected by wild type race T collapsed (arrow) but those infected by mutants 
5 remained alive. In Figure 1 IB, a plate containing T-toxin-sensitive E. coli cells 
was inoculated with agar blocks bearing mycelia of three strains (in the same 
order as in Figure 1 1 A) and incubated at 32°C overnight (the inoculum at bottom 
is race O control). No significant differences in T-toxin production (indicated by 
halos) among the three strains were detected. 

1 0 Figure 1 2 is a photograph which displays that the REMI mutant 

R.C4.2696 produces lesions much smaller than wild type on N-cytoplasm corn. 
Two week old N-cytoplasm corn plants were inoculated with conidial suspensions 
at the same concentration and symptoms were recorded 7 days after inoculation. 
The mutant (right) produces the same number of lesions as wild type (left) but the 

1 5 size of lesions was dramatically reduced compared to wild type (arrows). 

Figures 13A-B show that the REMI mutant R.C4.2696 has a 60% 
reduction of virulence compared to wild type. In Figure 13 A, lengths of 100 
typical lesions from corn leaves inoculated with wild type race O and a mutant 
progeny R45 {Tox, hygB*) carrying the R.C4.2696 mutation were measured 7 
20 days after inoculation and values plotted. Figure 1 3B shows the statistical 

analysis that 86% of the mutant lesions are less than 4 mm in length (average size 
3.5 mm), 60% reduced compared to that of wild type (8.5 mm). 

Figures 14A-B illustrate the genetic analysis showing that a tagged, 
single site mutation is responsible for the mutant phenotype. Figure 14 A is a 

25 photograph of a plant assay where N-cytoplasm corn was inoculated with parents 
and progeny indicated in Figure 14B (a complete tetrad from the second cross is 
shown here; random spore assay for the two crosses gave the same results). 
Figure 14B is a table which shows progeny segregation data. In both crosses, 
progeny segregated 1 : 1 for parental type only and all hygromycin B resistant 

30 progeny showed the same small lesion phenotype (represented by progeny 1-4 in 
Figure 14A) as the mutant parent (parent 1 in Figure 14A); all hygromycin B 
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sensitive progeny show the large lesion phenotype (represented by progeny 5-6 in 
Figure 14 A) of the wild type parent (parent 2 in Figure 14 A). 

Figure 15 is a restriction map of the cloned sequences surrounding 
the tagged site. A 1 1.3 kb genomic region (thick line) was cloned and completely 

5 sequenced. The original REMI insertion point in the mutant R.C4.2696 is 

indicated by a vertical arrow. The asterisks indicate two targeted integration sites 
in the wild type genome. Two open reading frames (in opposite directions), 
ORF1 (CPS1, 5.4 kb) and ORF2 (TES1, 1.1 kb) are indicated by open boxes 
below the map (the positions of putative introns are indicated by vertical bars). 

1 0 Locations of seven overlapping plasmid clones used for sequencing are indicated 
by thin lines on the top of the map (filled triangles represent the vector sequence 
in each clone). Sequencing strategy is indicated by arrows above each clone line. 

Figure 16 graphically depicts the G+C content of the cloned 
sequences surrounding the tagged site. The 1 1.3 kb of genomic DNA sequence 

1 5 (Figure 1 5) was examined as 200 bp fragments and G+C content of each fragment 
was calculated and plotted. Regions A and C have normal G+C content; regions 
B and D show high G+C content. The overall percentage of G+C in each region 
is indicated by underlined numbers. The positions of ORF1 {CP SI) and ORF2 
(TES1) are indicated by arrows and the percentage of G+C of each ORF is given 

20 below (in parentheses). 

Figure 17 displays the nucleotide sequence of CPSL 5,725 base 
pairs from the 1 1.3 kb sequenced region (Figure 15) are shown. The deduced 
amino acid sequence of CPS 1 protein is given below the DNA sequence. The 
position of start codon ATG (bold and underlined) is designated +1 and the open 

25 reading frame stops at position 5381 (TAG, in bold and underlined). Five putative 
M CAAT" boxes (bold and indicated by asterisks) are found at the positions -36, - 
58, -67, -172 and -309. Three putative introns (in lowercase with 5' and 3* splice 
sequences in bold; branch sites are underlined) are located at positions 2070-21 14, 
3542-3592 and 4197-4249. Conserved core sequences are shaded. TheGXSXG 

30 motif is boxed. A putative polyadenylation signal is located at position 5604 
(bold and overlined). 
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Figure 18 displays the nucleotide sequence of TES1. 1 ,901 base 
pairs 5' of CPS1 are shown. The deduced amino acid sequence of the TES1 
protein is given below the DNA sequence. The CT motif (bold and indicated by 
asterisks) is found at position -46. A "AT'-rich region (bold and italicized) is 

5 found at position -12. The position of ATG start codon (bold and underlined) is 
designated +1 and the open reading frame (transcribed in the opposite direction 
from CPS1) stops at position 1 153 (TGA, in bold and underlined). One putative 
intron (in lowercase, border sequences are bold; branch sites are underlined) is 
located at position 518-566. A putative polyadenylation signal is located at 

10 position 1345 (bold and overlined). The putative active site (VHS) is shaded. 

Figures 19A-C are schematic representations which show the 
characterization of modular structure of CPSL Peptide synthetase and 
thioesterase are indicated by open boxes; shaded boxes inside indicate functional 
domains and modules; vertical bars in the shaded boxes indicate highly conserved 

1 5 core sequences. Figure 19A illustrates the general structure of bacterial and 
fungal peptide synthetases (adapted from Marahiel 1997, which is herein 
incorporated by reference). A peptide synthetase gene cluster is shown on the top. 
There can be one or more amino acid activating module (cyclosporine synthetase 
has 1 1) in each protein; some peptide synthetases have thioesterase domains (TE), 

20 which can be either integrated into modules or encoded by a separate gene. Each 
synthetase can have type I, type II or both modules. A type I (minimal) module is 
enlarged to show organization of core sequences and domains. Some peptide 
synthetases also have condensation or epimerization domains. Figure 19B 
illustrates the organization of saframycin Mxl synthetase containing 4 amino acid 

25 activating modules (Pospiech et. al. 5 "Two Multifunctional Peptide Synthetases 
and an O-methyltransferase are Involved in the Biosynthesis of the DNA-Binding 
Antibiotic and Antitumour Agent Saframycin Mxl from Myxococcus xanthus" 
Microbiology , 142(4):741-746 (1996)). SafBl from the first module is enlarged. 
Core sequences 1 and 5 in safBl are weakly conserved (indicated by dashed 

30 vertical bars). The remaining domains are typical of type I as shown in Figure 
19 A. SafC is a putative O-methyltransferase. Figure 19C illustrates the 
organization of CPSL Sequence analysis revealed two amino acid activating 
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modules (CPS1A and CPS IB), both of which have high similarity to safBl except 
that core 2 is weakly conserved. A thioesterase domain is found at the C-terminal 
region of CPS IB. Three vertical arrows indicate the positions of targeted gene 
disruptions in the wild type genome that yielded the mutant phenotype. TES1 is a 
5 thioesterase encoded by a separate gene (TES1). 

Figures 20A-F are comparative alignments of core amino acid 
sequences in CPS1A and CPS IB with those of other peptide synthetases. Figures 
20A-E are comparative amino acid sequence alignments of amino-acid-activating 
domains (cores 1-5); Figure 20F is a comparative amino acid sequence alignment 

1 0 of the thioester formation domain (core 6). In each subfigure, the first column 
shows the names of peptide synthetases; the second indicates the position of the 
first residue aligned in the original amino acid sequence of each protein; the last 
column on the right indicates the number of amino acids between two cores 
(Figures 20A-E, in parentheses) or the distance between two adjacent amino-acid- 

1 5 activating modules (Figure 20F, in parentheses). There is an extra column on the 
right in Figure 20F, showing the total number (underlined) of residues in each 
amino-acid-activating module in which the aligned core sequence is located. The 
consensus of each core sequence is on the top, which includes identical or similar 
residues found in all peptide synthetases or with only a few exceptions (active site 

20 is also indicated by asterisks). SafBl : the first module in saframycin Mxl 
synthetase B of Myxococcus xanthus (Genbank accession U24657); GrsA: 
gramicidin S synthetase A of Bacillus brevis (SWISS PROT accession PI 4687); 
HTS1A and HTS1B: the first two modules in HC-toxin synthetase of 
Cochliobolus carbonum (Q01886); EsynA and EsynB: two modules in enniatin 

25 synthetase of Fusarium scirpi (EMBL accession Z18755); ACVA and ACVB: the 
first two modules in ACV synthetase of Aspergillus nidulans (SWISS PROT 
PI 9787); CsynA and CsynB: the first two modules in cyclosporin synthetase of 
Tolypocladium nivenm (EMBL Z28383). 

Figure 21 is a comparative alignment of amino acid sequences of 
30 active sites of thioesterase domains (TE) in CPS1 with those of other peptide 
synthetases. ACV: ACV synthetase (Swiss-PROT accession P19787); GrsB: 
gramicidin S synthetase B (PI 4688); GrsT: the thioesterase encoded by grsT 
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(P 14686) in gramicidin S synthetase gene cluster; SrfA: surfactin synthetase A-3 
(Q08787); TycC: tyrocidine synthetase C (Genbank accession AF004853); TycF: 
the thioesterase encoded by /ycF (AF004853) in the tyrocidine synthetase gene 
cluster. The highly conserved residues (GXSXG) are indicated by asterisks. The 
5 number on the left of each amino acid sequence indicates the original position of 
the first residue; the number on the right (in parentheses) indicates the distance 
between the last residue shown to the end of each protein. 

Figure 22 is a comparative alignment of the amino acid sequence 
of the TESJ protein (CH-TES1) with that of other type II thioesterases. HS-TEII: 

1 0 Homo sapiens thioesterase II (EMBL accession X86032); EC-TESB: E. coli acyl- 
coA thioesterase II (Genbank accession M63308); MT-TESB: Mycobactrium 
tuberculosis homolog to E. coli. acyl-coA thioesterase II (EMBL Z95387). The 
identical residues in all four proteins are in bold. The putative active site VHS 
motif is indicated by asterisks. The numbers on the right column indicate the 

1 5 original position of the last residue of the line in each protein sequence. The 
entire protein sequence of each TES was aligned using the Jotun Hein Method. 
Amino acids corresponding to the positions 142-171, 236-265 and 356-367 in CH- 
TES1 have no significant similarity among the four proteins. 

Figures 23A-B are photographs displaying a plant assay (Figure 
20 23A) and a gel blot (Figure 23B). Targeted gene disruption suggests that CPS1 is 
involved in fungal pathogenesis. In Figure 23A, N-cytoplasm corn was inoculated 
with (left to right): The REMI mutant R.C4.2696; wild type race T; wild type race 
O and five disruptants obtained using the linearized p214B7 as vector. All 
disruptants give the small lesions similar to the original mutant. In Figure 23B, 
25 total genomic DNA was digested with Bglll and probed with both 5' and 3' end 
flanking DNA fragments carried on p214B7 separately (strain order is the same as 
above). A single band (4.2 kb) is present in both wild type race T and race O but 
replaced by a 9.3 kb band (increased by the size of the vector, 5.1 kb) in all strains 
that showed the mutant phenotype in Figure 23A. 

30 Figure 24 displays a gel blot analysis showing targeted integration 

of the chromosome walking vector p214SNP into the wild type genome. Lane 1 : 
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wild type race O. Lanes 2-8: seven transformants obtained by transforming wild 
type race O (strain C5) with p214SNP. Genomic DNAs were digested with Bglll 
and probed with 3.2 kb flanking genomic DNA fragment cloned on p214Sl which 
gives three bands in wild type (4.2, 2.1 and 0.5 kb, indicated by arrows on the 

5 left). The targeted site is in the 2. 1 kb fragment which is missing in all disruptants 
(indicated by an arrow on the right). Other two bands are intact as predicted. All 
disruptants showed the same mutant phenotype in the plant assay as shown in 
Figure 23. Five disruptants (lanes 2, 3, 5, 7 and 8) resulted from a single 
crossover integration. Integration in two other disruptants (lanes 4 and 6) has not 

1 0 been determined. One of the disruptants (#118, lane 2) was used to recover the 
plasmid pi 18B14 and pi 18BC4. 

Figure 25 displays a gel blot analysis showing targeted integration 
of the chromosome walking vector pi 18BSP into the wild type genome. Lane 1 : 
wild type race O; Lane 2: wild type race T; Lanes 3, 4 and 5: three transformants 

1 5 obtained by transforming wild type race O (strain C5) with pi 1 8BSP. Genomic 
DNAs were digested with Pst I (which cuts pUCATPH) and probed with the 3.2 
kb flanking genomic DNA fragment cloned on p214Sl which gives two bands in 
wild type (6.6 and 2.5 kb, indicated by arrows on the left). The targeted site is in 
the 6.6 kb region which is missing in all three disruptants (indicated by arrows on 

20 the right). The 2.5 kb band is intact as predicted. The third band (part of vector 
plus 4.6 kb genomic DNA) does not hybridize to the probe. All disruptants 
showed the same mutant phenotype in the plant assay as shown in Figure 23. Two 
disruptants (lanes 3 and 4) resulted from a double crossover integration. 
Integration in the third disruptant (lane 5) has not been determined. One of the 

25 disruptants (#9, lane 3) was used to recover the plasmid p9P2. 

Figure 26 displays a gel blot showing the detection of CPS1 
homologs in C. victoriae and C. carbonum. Genomic DNAs were digested with 
Bglll and probed with the 3.4 kb CPS1 fragment cloned on p214B7 (Figure 2) 
which includes most of the 4.2 kb Bglll fragment of CPS1 (Figure 15). Lanes 1 
30 and 2: C. heterostrophus race T (C4) and race O (C5), both of which hybridized to 
• the 4.2 kb fragment. Lanes 3: C. victoriae (Hvw). Lanes 4, 5 and 6: C. carbonum 
race 1 (26R13), race 2 (YugY) and race 3 (BZ1209). Note that both C. victoriae 
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and C carbonum (three races) hybridized to a 5.0 kb fragment and the 
hybridization signals of all three species showed about the same intensity. 

Figure 27 is a high-performance liquid chromatography (HPLC) 
profile of culture extracts from wild type C victoriae (HvW, top left) and three 
5 transformants (Tx7, Tx2 and Tx9). The major peak for victorin C is indicated by 
arrowheads. No significant differences in victorin production were detected 
between the wild type and transformants. Three other transformants (Tx4, Tx5 
and Tx8) gave the same results. 

Figure 28A-B are photographs which display a plant assay (Figure 

10 28 A) and a gel blot (Figure 28B), showing targeted disruption of the CPS! 

homolog in C victoriae. Figure 28A shows C victoriae transformants (Tx) with 
reduced or wild type pathogenicity to susceptible oats. Oat seeds were inoculated 
with conidial suspensions of (left to right) wild type, Tx7, Tx2 and Tx9 (the last 
pot on the right is the uninoculated control). Two transformants (Tx7 and Tx2) 

1 5 showed dramatically reduced pathogenicity as indicated by the substantial growth 
of the oat plants. These two transformants resulted from a homologous 
integration (lanes 2 and 3 in Figure 28B). Tx9, which killed all oat plants as wild 
type, resulted from a ectopic integration (lane 4 in Figure 28B). All three 
transformants produced wild type level of victorin as determined by HPLC 

20 analysis (Figure 27). Figure 28B shows disruption of CPS1 homolog in the wild 
type genome. Genomic DNAs were digested with BgRl and probed with the 3.2 
kb CPS1 fragment (Kpn\-Sac\) cloned on p214Sl (Figure 2) which hybridized to 
two fragments (4.2 and 2.2 kb, see Figure 15) in C heterostrophus (lane 1), but to 
three fragments (5.0, 1 .8 and 0.2 kb) in the C victoriae (lane 2). In Tx7 (lane 3) 

25 and Tx2 (lane 4), one or two of the wild type fragments was replaced by a larger 
fragment (8.0 kb for Tx7 and 9.4 kb for Tx2) containing the transforming vector. 
The actual size increase by the vector integration can not be predicted because the 
presence of polymorphic bands in C victoriae genome and the presence of 
duplicated CPS! fragments on the transforming vector (Figure 5). In Tx9 (lane 

30 5), which caused wild type symptoms shown in Figure 28A, all three wild type 
bands (indicated by arrows) are intact, confirming an ectopic integration. 



SUBSTITUTE SHEET (RULE 26) 



WO 01/38489 



PCT/US00/32227 



-20- 

Figure 29 shows the REMI vector pUCATPH. This vector was 
constructed by insertion of a 2.4 kb Sail fragment containing the selectable marker 
cassette (the largest arrow) from pDH25 (Cullen et al., 1987) into the Sail site of 
polylinker site of pUC18 (between lad and lacZ). Only six-base-pair restriction 

5 enzyme sites are shown on the map. Italicized sites are unique; three of them, 
//mdlll, Kpnl, and Sad (bold) have been used for REMI transformation. Non- 
cutting enzymes are listed below the map (enzymes that recognize six-base-pair 
sites are underlined), amp, Ampicillin resistance gene; hygB, hygromycin B 
resistance gene; PtrpC, A. nidulans trpC promoter; TtrpC, A. nidulans trpC 

1 0 terminator; ori, Escherichia coli origin of replication. 

Figures 30A-30C are photographs of DNA gel blots showing 
DNA-DNA hybridization of ChCPSl to other fungal genera and species. In 
Figure 30A, the gel was loaded with Cochliobolus species (lanes 1-17) as follows: 
C. heterostrophs race T, race O; C. carbonum race 1, race 2; C. victoriae isolates 

15 FI3, HvW; C. bicolor, C. dactyloctenii, C. chloridis, C. homomorphus, C. 

intermedins, C. melinidis, C. melinidis, C. peregianensis, C. perotidis, C. ravenelii 
and C. sativus. Figure 30B is a photograph of a DNA gel blot from a gel loaded 
with other Ascomycete genera (lanes 1-14) as follows: C. carbonum racel 
(control), Setosphaeria rostrata, Stemphyliium spp., Pyrenophora tritici repentis, 

20 Bipolaris sacchari, Alternaria spp., A. solani, Nectria haematococca, Fusarium 
oxysporum, Glomerella spp. Magnaporthe grisea, F. moniliforme, F. moniliforme 
(repeat) and A. solani (repeat). Figure 30C is is a photograph of a DNA gel blot 
from a gel comparing Candida albicans to C. heteroslrophus and closely related 
species (lanes 1-7): C. heterostrophs race T, Bipolaris sacchari, Setosphaeria 

25 rostrata, Stemphyliium spp., Pyrenophora tritici repentis, Alternaria spp. and 
Candida a/6/ca/«(arrowhead). Genomic DNAs were digested with Hindlll (A, 
lanes 1-17; B, lanes 1-1 1; C, lanes 1-7), Xhol (B, lanes 12 and 14) or flg/II (B, 
lane 13) and probed with the 3.2 kb fragment of CPS1 from p214Sl (Fig. 2) at 
high stringency. Weak signals in lanes 3 and 17 (panel A) are due to insufficient 

30 DNA loading (confirmed by a repeat experiment). 

Figure 31 A is a structural comparison of the four, CPS; homologs 
to ChCPSl. ORFs are indicated by the open boxes; shaded boxes inside indicate 
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functional domains; vertical bars indicate conserved motif sequences found in 
nonribosomal peptide synthetases (NRPS) as defined by Stachelhaus and Marahiel 
(Stachelhaus and Marahiel, 1995; Marahiel, 1997) (dashed bars indicate weak 
conservation). The black bulbs indicate the position of putative introns. Cores 1-5: 

5 adenylation; core 6: thioaltion; TE: thioesterase. The distance between core 
sequences is not drawn in exact scale. The name of proteins is on the left of the 
ORF boxes and the number of amino acids on the right. The unidentified regions 
of AsCPSl and PtCPSl are indicated by dash-lined boxes. The similarity to 
ChCPSl (in the overlapping region only, see text for details) is given in the 

1 0 parentheses under the protein names in the order: nucleotide identity/ amino acid 
identity/ amino acid similarity. The positions of the ChCPSl amino acid 1040 is 
indicated by the open arrow; the positions 51 1 and 1269 (to the first and the last 
amino acids of AsCPSl and PtCPSl) are indicated by filled triangles. Figure 31B 
is an amino acid alignment of the four CPS1 homologs to ChCPSl . 530 amino 

15 acids aligned to the amino acids 511-1040 of ChCPSl (shown in A) are shown. 
The identical residues are in uppercase and the similar residues in lowercase. 
Consensus of sequences similar to the typical NRPS signature motifs is 
underlined. The putative cyclization domain motif "DXXXXD/ EXXS/ A" is 
underlined. 

20 Figure 32 is the nucleotide sequence of FgCPSl. 6,003 base pairs 

cloned using the plasmid rescue procedure are shown. The amino acid sequence of 
FgCPSl protein is given below the DNA sequence. The position of the start codon 
ATG (bold and underlined) is designated +1 and the open reading frame stops at 
position 5123 (TGA, bold and underlined). A "CT" motif (italicized and 

25 underlined) and two putative "CAAT" boxes (bold with asterisks) are found at 

positions -30, -204 and -302. A putative intron (in lowercase with 5' and 3' splice 
sequences in bold; branch sites underlined) is located at positions 4245-4290. 
Conserved core sequences are shaded and the putative cyclization domain motif 
"DXXXXEXXA" (position 2323-2346) is underlined. A putative polyadenylation 

30 signal "AATAA" at position 51 88 is bold and overlined. 

Figure 33 shows the nucleotide sequence of AsCPSl. 2,369 base 
pairs amplified by PCR are shown. The amino acid sequence of AsCPSl protein is 
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given below the DNA sequence. The sequence is not complete. Two putative 
introns (in lowercase with 5' and 3' splice sequences in bold; branch sites 
underlined) are located at positions 540-584 and 2012-2059. Conserved core 
sequences are shaded and the putative cyclization domain motif "DXXXXDXXS" 
5 (positions 694-720 ) is underlined. The PCR primer binding sites at the 5' and 3' 
end are underlined. 

Figure 34 shows the nucleotide sequence of PtCPSl. 2,320 base 
pairs amplified by PCR are shown. The amino acid sequence of PtCPSl protein is 
given below the DNA sequence. The sequence is not complete. A putative intron 
1 0 (in lowercase with 5' and 3' splice sequences in bold; branch sites underlined) is 
located at positions 540-583. Conserved core sequences are shaded and the 
putative cyclization domain motif "DXXXXDXXS" (positions 693-719) is 
underlined. The PCR primer binding sites at the 5' and 3' end are underlined. 

Figure 35A is a photograph of a DNA gel blot showing that the 
1 5 2.2 kb wild type band (arrowhead) is disrupted in homologous transformants 

TxFgC8-4, -10, -1 1 and -5 but is intact in the ectopic transformant TxFgC8-Hl, - 
H2and-Bl [generated using HindlH (H) or 5g///(B)-digested pFgC8-hygB]. 
Genomic DNAs were digested with Cla\ and probed with pFgC8 which carries a 
1 .0 kb FgCPSl fragment. Figure 35B is a photograph of plants from a virulence 
20 assay showing F graminearum transformants (Tx) with reduced or wild type 
virulence to wheat. Wheat heads were inoculated with conidial suspensions 
(10 4 /ml) of (left to right) wild type, TxFgC8-4, -10, -HI, and -1 1 or with water 
only. Photograph was taken 7 days after inoculation. Note that most spikelets of 
wheat heads inoculated with homologous transformants looked "healthy" in 
25 contrast to those inoculated with ectopic transformants that were completely 
"bleached" (indistinguishable from wild type). 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention generally relates to an isolated DNA 
molecule from a plant pathogen encoding a CPS 1 peptide synthetase. In one 
30 embodiment, the DNA molecule has a nucleotide sequence which hybridizes to a 
DNA molecule having a sequence corresponding to SEQ. ID. No. 2 as follows: 
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TGCCTGCGCC TGTGCTTGTG CCTGTGGAAT 
TACATACAAC ACCATCCCAT CCCGCTTCAC 
CCGCCGCCCA CAACACCATG GCTGCGACCA 
TGGACCACGA GCTCGAGGAG GGCGATATTA 
5 ACCGCCATCC GCCTCTCTGC GTGCGCTAAT 
GTGCTGCTGT CGCAGTATCT AGGGCCTGAC 
CAGCAGAACC CACCCCAACC ATCCAGTGAG 
ATTCCGTCCG GTCCGAGTCC ATCACNGCGA 
CCCGACTCAT ACCATGACGC TTCCGCACAG 

10 AACGCCTCCG CCGCTGCCTC GGGGGGCTCG 
GGCCGTTTTC AAGAGAAGCA GCTGGGCCTG 
CAGCTGTCGC AAGGAAGCGA GACGTTCATT 
TCGCGCGAGC CCACCATGAT GGGCAACTAC 
GATGGCCAAT TTGGCTCTCC GGGAGAGGCC 

15 CAGGGTTATT TTTCCGACTT CACAGGCCAG 
GGACCCAACC GCTACTCGTC GGGAGATGCC 
ATGATGAACC CCAACGATCT CCCCTTGGGC 
CGCGATCTGC CTTTTGACGT TTACGACCCT 
GACAACATTG GCGCTGTCTT GCGTCACCGA 

20 TGGGTCCTTG ACGCAAAAGG CAAAGAGACG 
CGCGCGGAAA AGGTGGCCAA AGTGATTCGG 
GTGGCATTAG TGTACAGGGA TACAGAAATC 
TTCATTGCGG GCGTTGTAGC GGTACCCATC 
CTTCTCCTAA CGACAACTCA AGCTCATCTC 

25 TTTCATCGTG ACATTAGTCA GAACCGTCTG 
ACGAACGAGT TTGGCAGCCA CCACCCCAAG 
CCAGAGGTTG CCTATATTGA GTTCTCGCGT 
CTTAGTCACC GGACTATTAT GCACCAAATG 



GTCGCGGCCC GCTGCTGCAT AGCCTATCTG 60 

CTGCCTTGCC TCCCTCCTCG TGCCACACAT 120 

ACCCCGAGCT GCAGGCCAAA CTGCAGGAGC 180 

CACAAAAAGG GTCCGTACTG CTGCACCACC 24 0 

CAGTCGCATA GCTATGAAAA ACGTCGCACC 300 

TTTGCTGCCC AGTTGCAGGC CGACCTGAAC 360 

GGCTCTCGCT CCCGCACCGC ATCCTTTGCT 420 

CCACAACCCC CACATATCCA GCTCCCCCGC 480 

GGCCAATTGG GCGCACCCAT GCCATATGCG 540 

CAGTACATGG CATACCCGCC CAGCCAAGTC 600 

CGTACAAATT CGCTCCAGCG CAATTCCTCA 660 

CCACGGCCTC AAACGCCTGA ATACAACCAC 720 

GCCTTCAATC CAGACAATCA GCAAAGTTAT 780 

AGTCGAAGGA GCACCATGCT CGAGGTAAAC 84 0 

CAGATGCAAG ACAATCGCGA CTCGTATGGG 900 

TTTTCTCCTA CCGCCGCGAT TCCACCTCCC 960 

GCTGCTGAAA CCATGATGCC GCTAGAGCCC 1020 

CACAACCCCA ATGTCAAAAT GTCAAAGTTT 1080 

AGTCGCACAC AG CCAAGGAC GACTGCCTTC 1140 

GCGTCCATCA CCTGGGAAAA GGTGGCTAGT 1200 

GACAAGAGCA ACCTCTATCG AGGCGACCGT 1260 

ATTGATTTTG TCGTGGCGTT GATGGGCTGC 1320 

AATAGCGTCG ACGACTACCA GAAACTCATT 1380 

GCATTGACCA CAGACAACAA TCTCAAGGCC 144 0 

AAATGGCCGA GTGGGGTAGA GTGGTGGAAG 1500 

AAACATGACG ATACTCCAGC TTTGCAAGTA 1560 

GCACCTACTG GTGACCTTCG CGGTGTGGTG 1620 

GCCTGCATCA GTGCCATGAT TAG CACG ATA 1680 
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CCCACCAACG CTCAGAGCCA AGACACGTTC AGCACTAGCC TACGGGATGC AGAGGGAAAG 1740 
TTCGTTGCTC CAGCACCGTC CAGAAACCCC ACAGAAGTGA TCCTCACGTA CCTCGACCCG 1800 
CGCGAAAGCG CTGGTCTCAT TCTCAGTGTC TTGTTTGCAG TTTATGGAGG CCACACCACC I860 
GTATGGCTCG AGACAGCGAC CATGGAAACC CCGGGTCTAT ATGCACATCT CATCACCAAA 1920 
5 TACAAGTCCA ACATACTGCT AGCGGATTAC CCAGGCCTCA AGCGCGCTGC ATACAACTAC 1980 
CAACAGGATC CAATGGCTAC AAGAAACTTC AAGAAAAACA CAGAACCCAA CTTCGCCTCC 2040 
GTGAAGATCT GTCTGATTGA CACGCTTACC GTCGACTGTG AATTTCACGA AATTCTCGGA 2100 
GATCGATATT TCAGGCCACT GCGAAACCCT AGAGCGCGAG AACTGATCGC GCCAATGCTC 2160 
TGCTTGCCAG AACATGGTGG AATGATAATA TCTGTACGCG ACTGGCTAGG TGGAGAGGAG 2220 
10 CGCATGGGCT GCCCGCTAAG CATAGCAGTA GAAGAGTCAG ATAATGATGA AGATGATACA 2280 
GAGGATAAGT ATGCAGCGGC AAATGGCTAC TCCAGTCTTA TTGGTGGTGG CACTACAAAG 2340 
AACAAAAAGG AGAAGAAGAA GAAAGGCCCG ACAGAGCTTA CAGAAATCTT GCTGGACAAG 24 00 
GAAGCTCTGA AGATGAACGA AGTCATTGTT CTGGCCATTG GAGAAGAAGC AAGCAAGCGG 2460 
GCAAACGAGC CCGGCACCAT GCGAGTCGGT GCCTTTGGAT ACCCCATACC GGATGCGACA 2520 
15 CTAGCTATTG TAGACCCTGA GACAAGTCTT CTATGTTCAC CATACTCGAT AGGCGAGATC 2580 
TGGGTAGATT CGCCTTCACT CTCTGGTGGC TTCTGGCAGC TGCAGAAGCA TACAGAGACC 2640 
ATTTTCCATG CTCGACCATA CCGTTTCGTT GANGGTAGCC CTACGCCACA GTTGCTTGAA 2700 
CTCGAGTTTC TGCGTACTGG ACTCCTCGGC TTTGTTGTAG ANGGAAAAAT ATTTGTCCTT 2760 
GGACTGTACG AAGATCGCAT CAGACAGCGT GTTGAATGGG TAGAAAATGG TCAGCTTGAA 2 820 
20 GCCGAGCATC GATACTTTTT TGTGCAGCAC CTGGTCACAA GCATTATGAA GGCCGTGCCA 2880 
AAAATTTACG ACTGGTAAGT GAGCTGCCAA CAGAGCAAGG ACTGTCTAAC GTGTCATAGC 2940 
TCGTCGTTTG ATTCTTATGT AAATGGTGAA TACCTGCCAA TCATTCTCAT CGAGACGCAG 3000 
GCCGCATCGA CTGCGCCCAC AAACCCAGGT GGACCACCAC AACAATTGGA TATACCATTT 3060 
TTGGATTCAC TATCTGAGAG GTGCATGGAG GTCCTTTACC AAGAGCATCA TTTACGGGTA 3120 
25 TACTGCGTGA TGATTACAGC ACCTAATACA CTTCCACGAG TCATCAAGAA CGGACGGCGA 3180 
GAAATTGGCA ATATGCTGTG TANGAGANAG TTTGACAATG GCTCTCTGCC CTGTGTNCAC 3240 
GTNAAGTTTG GCATTGAGCG ATCAGTGCAG AACATTGCGC TCGGTGACGA TCCCGCTGGC 3300 
GGCATGTGGT CATTTGAGGC ATCAATGGCA CGTCAGCAAT TCTTGATGCT CCAAGACAAG 3360 
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CAATACTCTG GTGTCGATCA TCGCGAAGTC 
AATCAGTTCT CGAATATCCA CGACCTGATG 
CTTGCTTACT GCACTGTCGA CGGTCGAGGA 
TTTGATCAAA AGGTTGCGGG CGTAGCAATG 
5 GGCGATCATC TCCTTCTGAT GTACACGCAT 
TGTTTTGTGC TTGGAGCTGT TTGCATACCA 
GAGGATGCGC CGGCCTTGCT GCATATCCTT 
AACGCTGACG TTGACCATCT GATGAAGATC 
GCCGCTATCC TCAAGATCAG TGTGCCAAAC 

10 TCCAGTGGCT GCCGCGACCT CAAGCTTACA 
CCAGTGCTAG TCTGGACATA CTGGACGCCC 
CATAGCCAAA TCATGGCACT GTGCAAGGTC 
CGACCAGTCC TTGGTTGTGT CCGGAGCACG 
ATGGGAATCT TCCTTGCCGC ACCCACATAC 

15 CCTAATATTC TGTTCCAAAC GCTTTCGCGG 
CAAATGTTGG ACCACGCCAT CGCACGCGGA 
AAGAATCTCA TGATTGCGAC TGATGGAAGA 
GTATGAGAGG ACTTTCATGA TTGCTAACTC 
GCGCCAGCCA ACTTAGACCC AACCGCAATC 

20 ATGGTAGCAT CACGATCATA CATGTGTATT 
GCTCTGCGAC GCGGCCTCGT CATGCCCGTT 
GTCCAAGACT CGGGCATGGT GCCAGTGAGC 
AACCAACTGT GCTTGAACGG CGAGTACGGC 
TATAGCTTCT ACATGTCGAA AGAGCGCTTG 

25 GACGGAGACC CAAATGTGCG ATATGTTCGT 
ACACGGCCCA TTGGACCCAA CGGTGCACCT 
AGCATAGGTG ACACTTTTGA AGTCAACGGA 
TCTGTTGAAC GTTGTCACCG GAATATTGTC 
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GTCATTGACG ACAGGACATC GACTCCACTC 3420 

CAATGGCGTG TATCTCGGCA GGCCGAGGAA 3480 

AAAGAGGGCA AAGGCGTCAA TTGGAAGAAG 354 0 

TACCTCAAGA ACAAGGTCAA GGTCCAGGCC 3600 

TCAGAAGAAT TTGTTTATGC TGTTCATGCA 3660 

ATGGCGCCAA TTGATCAGAA CCGGTTGAAT 3720 

GCAGATTTCA AGGTCAAAGC CATTCTTGTC 3780 

AAGCAAGTAT CGCAGCACAT CAAACAATCG 384 0 

ACATACAGCA CAACAAAGCC GCCAAAGCAA 3900 

ATTCGACCGG CATGGATTCA GGCGGGTTTC 3 960 

GATCAACGTC GTATCGCAGT TCAGCTGGGC 4 020 

CAAAAAGAAA CATGCCAAAT GACAAGTACA 4080 

ATAGGACTTG GTTTCCTTCA CACTTGTCTC 414 0 

CTGGTGTCAC CTGTTGACTT TGCACAAAAC 4200 

TACAAGATCA AGGATG CATA TGCAACGAGT 4260 

GCTGGTAAGA GTATGGCTCT GCACGAGCTG 4320 

CCACGCGTTG ATGTTTGTAA GTGAACATTT 4 3 80 

AATGCAGACC AAAGAGTGCG TGTGCACTTT 4440 

AACACTGTCT ACTCACATGT ATTGAACCCA 4 500 

GAGCCAGTCG AGCTCCATCT CGATGTGCAT 4560 

GACCCTGACA CAGAGCCCAA CGCTTTGCTC 4620 

ACG CAAAT AT CCATTGTCAA CCCAGAGACC 4680 

GAGATCTGGG TGCAGTCCGA GGCGAATGCT 474 0 

GATGCAGAAC GCTTCAATGG GAGGACGATT 4 800 

ACAGGCGATT TAGGATTTTT GCACAGCGTG 4860 

GTTGATATGC AGGTGCTTTT CGTGCTTGGA 4920 

CTGAACCATT TCTCTATGGA CATTGAGCAG 4980 

CCTGGAGGCT GGTACGTTTC TTCGATTCGC 504 0 
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TGTTATTTAG TAAATACTTA CTAACACTCT ACAGTGCTGT TTTCCAGGCA GGTGGGCTTG 5100 

TTGTTGTCGT TGTGGAAATC TTCCGACGCA ACTTCCTCGC AAGCATGGTG CCTGTGATTG 5160 

TCAATGCAAT TTTGAACGAG CATCAGCTGG TCATTGACAT TGTCTCGTTT GTGCAAAAGG 5220 

GCGACTTCCA CCGGTCTCGT CTGGGCGAGA AGCAACGCGG AAAGATTCTT GCAGGATGGG 5280 

5 TCACACGGAA GATGCGCACA ATAGCCCAGT ACAGTATACG GGATCCTAAT GGACAGGATT 5340 

CCCAGATGAT CACGGAAGAG CCTGGTCCAC GGGCTAGATG ACTGGAAGTA TGCTTGGGCG 5400 

AATGGGCGGC CCAGCCAGTA TCAAGGCCGG GTCGACAAGA GCACCGAGTC TAATGGGCAT 5460 

GACAGCGACT ATGAATAATC TATCCCTTAC ACAG CAGCAA CAGCAGCAAT ACCAACAGCC 5520 

GGGTATGTAT GCTCAACAGC AAGGCATGCA CCCCCAGCAA CAACACCAAT TTAGCATGTC 5580 

10 CAACACGCCA CCACAAGGTC CACCCCAAGG CGTAGAACTA CATGATCCTA GCGACCGCAC 564 0 

ACCAACAGAC AACCGGCACT CTTTCCTTGC CGACCCGCGT ATGCAGAACC AGGG CCAAAT 570 0 

GAACGAGACG GGCGCCTACG AACCCATGAA CTATCAAAAC GCGTATCATC CGCATCAACA 5760 

ACAATACGAA TCTGAAGACG GGGGGAGCAG ACTCAGCGGC CCCGTGCCAG ACGTGCTGCG 5820 

GCCGGGTCCT TCATCCGGGT CCATAGAGCA GCACGACCAA GCTAACAACG ACAACAATAT 5880 

15 GTGGAATAAT CGCGAGTACT ATGGTAACAG CCCATCGTAT GCAGGCGGAT ACACGCAAGA 594 0 

TGGCAATATC CACGAGCAGC AACAACACGA TGAGTACACG AGTAATG CGT CATATGGCGG 6000 

AAATCAAGGA GCAGGCGGAG GCAGCGGCGG CGGTGGCGGT CTCCGAGTTG CAAATCGTGA 6060 

CAGCTCCGAC AGCGAGGGTG CAGATGACGA CGCTTGGAGA CGTGATGCCC TTGCTCAGAT 6120 

CAATTTTGCG GGCGGCGCTG CTGCTGCCTC CGCTGGAGCA CCTGCTGCTG GTGCTTCTTC 6180 

20 TTCGCAGCCG GGCCATGCGC AGTAGACGGG ATATGCGTGA GTTTTTTTTT AAATTTCGTA 624 0 

CATAGAGACC GTTGTATACG CAGGTTTCAA ATTAGAAGAG CGAATATGCA TATCAGCTGT 6300 

TGTTCAATGT TCTAGTTTGG GAAGGTTAAC CCCCCCCCCT TCCCCTTCCA AGACTTTTCA 6360 

CTTGTTTGTG TGTGATTTAA ATCTGGAGAT TTCAAATCTA CATCTCGCTA TACATAGGTG 6420 

TTGTTTGATA ACGTAGGGGG CAGAAGGGTA TCTCGTGATA TTAGACTGGG AGTTGCATGA 64 80 

25 ATCAAGGTGT TGAGCAAAAA AAGAGAGAGC GGTGAAGGGC GGGGGGGATA GGTGGTGTGC 6540 

ACGTGGCTG 6549 

In a preferred embodiment, the plant pathogen is Cochliobolus 
heterostrophus. In another preferred embodiment, the plant pathogen is 
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Cochliobolus carbonum, C victoriae, C sativus, C specifer, C. homomorphus. C 
dactyloctenii, Setosphaeria turcica, S. rostrata, or Bioplar is sacchari. 

The peptide synthetase of SEQ. ID. No. 2 has a deduced amino 
acid sequence corresponding to SEQ. ID. No. 3 as follows: 

5 Met Leu Glu Val Asn Gin Gly Tyr Phe Ser Asp Phe Thr Gly Gin 
Gin Met Gin Asp Asn Arg Asp Ser Tyr Gly Gly Pro Asn Arg Tyr 
Ser Ser Gly Asp Ala Phe Ser Pro Thr Ala Ala He Pro Pro Pro 
Met Met Asn Pro Asn Asp Leu Pro Leu Gly Ala Ala Glu Thr Met 
Met Pro Leu Glu Pro Arg Asp Leu Pro Phe Asp Val Tyr Asp Pro 
10 His Asn Pro Asn Val Lys Met Ser Lys Phe Asp Asn He Gly Ala 

Val Leu Arg His Arg Ser Arg Thr Gin Pro Arg Thr Thr Ala Phe 105 
Trp Val Leu Asp Ala Lys Gly Lys Glu Thr Ala Ser He Thr Trp 
Glu Lys Val Ala Ser Arg Ala Glu Lys Val Ala Lys Val He Arg 
Asp Lys Ser Asn Leu Tyr Arg Gly Asp Arg Val Ala Leu Val Tyr 
15 Arg Asp Thr Glu lie He Asp Phe Val Val Ala Leu Met Gly Cys 
Phe He Ala Gly Val Val Ala Val Pro lie Asn Ser Val Asp Asp 
Tyr Gin Lys Leu lie Leu Leu Leu Thr Thr Thr Gin Ala His Leu 
Ala Leu Thr Thr Asp Asn Asn Leu Lys Ala Phe His Arg Asp lie 
Ser Gin Asn Arg Leu Lys Trp Pro Ser Gly Val Glu Trp Trp Lys 
20 Thr Asn Glu Phe Gly Ser His His Pro Lys Lys His Asp Asp Thr 
Pro Ala Leu Gin Val Pro Glu Val Ala Tyr He Glu Phe Ser Arg 
Ala Pro Thr Gly Asp Leu Arg Gly Val Val Leu Ser His Arg Thr 
lie Met His Gin Met Ala Cys lie Ser Ala Met lie Ser Thr lie 
Pro Thr Asn Ala Gin Ser Gin Asp Thr Phe Ser Thr Ser Leu Arg 
25 Asp Ala Glu Gly Lys Phe Val Ala Pro Ala Pro Ser Arg Asn Pro 315 
Thr Glu Val He Leu Thr Tyr Leu Asp Pro Arg Glu Ser Ala Gly 
Leu lie Leu Ser Val Leu Phe Ala Val Tyr Gly Gly His Thr Thr 
Val Trp Leu Glu Thr Ala Thr Met Glu Thr Pro Gly Leu Tyr Ala 
His Leu lie Thr Lys Tyr Lys Ser Asn lie Leu Leu Ala Asp Tyr 



15 
30 
45 
60 
75 
90 



120 
135 
150 
165 
180 
195 
210 
225 
240 
255 
270 
285 
300 



330 
345 
360 
375 
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Pro Gly Leu Lys Arg Ala Ala Tyr 
Ala Thr Arg Asn Phe Lys Lys Asn 
Val Lys lie Cys Leu He Asp Thr 
His Glu He Leu Gly Asp Arg Tyr 
5 Arg Ala Arg Glu Leu He Ala Pro 
Gly Gly Met He He Ser Val Arg 
Arg Met Gly Cys Pro Leu Ser He 
Asp Glu Asp Asp Thr Glu Asp Lys 
Ser Ser Leu He Gly Gly Gly Thr 

10 Lys Lys Lys Gly Pro Thr Glu Leu 
Glu Ala Leu Lys Met Asn Glu Val 
Glu Ala Ser Lys Arg Ala Asn Glu 
Ala Phe Gly Tyr Pro He Pro Asp 
Pro Glu Thr Ser Leu Leu Cys Ser 

15 Trp Val Asp Ser Pro Ser Leu Ser 
Lys His Thr Glu Thr He Phe His 
Xaa Gly Ser Pro Thr Pro Gin Leu 
Thr Gly Leu Leu Gly Phe Val Val 
Gly Leu Tyr Glu Asp Arg He Arg 

20 Asn Gly Gin Leu Glu Ala Glu His 
Leu Val Thr Ser He Met Lys Ala 
Ser Ser Phe Asp Ser Tyr Val Asn 
Leu He Glu Thr Gin Ala Ala Ser 
Gly Pro Pro Gin Gin Leu Asp He 

25 Glu Arg Cys Met Glu Val Leu Tyr 
Tyr Cys Val Met He Thr Ala Pro 
Lys Asn Gly Arg Arg Glu He Gly 
Phe Asp Asn Gly Ser Leu Pro Cys 
Glu Arg Ser Val Gin Asn He Ala 

30 Gly Met Trp Ser Phe Glu Ala Ser 
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Asn Tyr 


Gin 


Gin Asp 


Pro 


Met 


390 


Thr Glu 


Pro 


Asn 


Phe 


Ala 


Ser 


405 


Leu Thr 


Val 


Asp 


Cys 


Glu 


Phe 


420 


Phe Arg 


Pro 


Leu 


Arg 


Asn 


Pro 


435 




Pvs 


Leu 


Pro 


Glu 


His 


450 


Aon TtTI 
nop 11^ 


Leu 


Gly Gly Glu 


Glu 


465 


Ala Val 


Glu 


Glu 


Ser 


Asp 


Asn 


480 


Tyr Ala 


Ala 


Ala 


Asn Gly 


Tvr 
j 


495 


Thr Lys 


Asn 


Lys 


Lys 


Glu 


LVS 


510 


Thr Glu 


He 


Leu 


Leu 


Asp 


Lvs 


525 


He Val 


Leu 


Ala 


He 


Gly 


Glu 


540 


Pro Gly Thr 


Met 


Arg Val 


Glv 


555 


Ala Thr 


Leu 


Ala 


He 


Val 


Asp 


570 


Pro Tyr 


Ser 


He 


Gly 


Glu 


He 


585 


Gly Gly 


Phe 


Trp 


Gin 


Leu 


Gin 


600 


Ala Arg 


Pro 


Tyr 


Arg 


Phe 


Val 


615 


Leu Glu 


Leu 


Glu 


Phe 


Leu 


Arci 


630 


Glu Gly Lys 


He 


Phe 


Val 


Leu 


645 


Gin Arg Val 


Glu 


Trp 


Val 


Glu 


660 


Arg Tyr 


Phe 


Phe 


Val 


Gin 


His 


675 


Val Pro 


Lys 


He 


Tyr 


Asp 


Cvs 


690 


Gly Glu Tyr 


Leu 


Pro 


lie 


He 


705 


Thr Ala 


Pro 


Thr 


Asn 


Pro 


Gly 


720 


Pro Phe 


Leu 


Asp 


Ser 


Leu 


Ser 


735 


Gin Glu 


His 


His 


Leu Arg 


Val 


750 


Asn Thr 


Leu 


Pro 


Arg 


Val 


He 


765 


Asn Met 


Leu 


Cys 


Arg Arg 


Glu 


780 


Val His 


Val 


Lys 


Phe 


Gly 


He 


795 


Leu Gly Asp 


Asp 


Pro 


Ala 


Gly 


810 


Met Ala 


Arg 


Gin 


Gin 


Phe 


Leu 


825 
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Met Leu Gin Asp Lys Gin Tyr Ser 
Val He Asp Asp Arg Thr Ser Thr 
He His Asp Leu Met Gin Trp Arg 
Leu Ala Tyr Cys Thr Val Asp Gly 
5 Val Asn Trp Lys Lys Phe Asp Gin 
Tyr Leu Lys Asn Lys Val Lys Val 
Leu Met Tyr Thr His Ser Glu Glu 
Cys Phe Val Leu Gly Ala Val Cys 
Gin Asn Arg Leu Asn Glu Asp Ala 
10 Ala Asp Phe Lys Val Lys Ala He 
His Leu Met Lys He Lys Gin Val 
Ala Ala He Leu Lys He Ser Val 
Lys Pro Pro Lys Gin Ser Ser Gly 
He Arg Pro Ala Trp He Gin Ala 
15 Thr Tyr Trp Thr Pro Asp Gin Arg 
His Ser Gin He Met Ala Leu Cys 
Gin Met Thr Ser Thr Arg Pro Val 
He Gly Leu Gly Phe Leu His Thr 
Ala Ala Pro Thr Tyr Leu Val Ser 
20 Pro Asn He Leu Phe Gin Thr Leu 
Ala Tyr Ala Thr Ser Gin Met Leu 
Ala Gly Lys Ser Met Ala Leu His 
Ala Thr Asp Gly Arg Pro Arg Val 
Val His Phe Ala Pro Ala Asn Leu 
25 Val Tyr Ser His Val Leu Asn Pro 
Met Cys He Glu Pro Val Glu Leu 
Arg Arg Gly Leu Val Met Pro Val 
Ala Leu Leu Val Gin Asp Ser Gly 
He Ser He Val Asn Pro Glu Thr 
30 Glu Tyr Gly Glu He Trp Val Gin 
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Gly Val Asp His Arg Glu Val 840 

Pro Leu Asn Gin Phe Ser Asn 855 

Val Ser Arg Gin Ala Glu Glu 870 

Arg Gly Lys Glu Gly Lys Gly 885 

Lys Val Ala Gly Val Ala Met 900 

Gin Ala Gly Asp His Leu Leu 915 

Phe Val Tyr Ala Val His Ala 930 

He Pro Met Ala Pro He Asp 945 

Pro Ala Leu Leu His He Leu 960 

Leu Val Asn Ala Asp Val Asp 975 

Ser Gin His He Lys Gin Ser 990 

Pro Asn Thr Tyr Ser Thr Thr 1005 

Cys Arg Asp Leu Lys Leu Thr 1020 

Gly Phe Pro Val Leu Val Trp 1035 

Arg He Ala Val Gin Leu Gly 1050 

Lys Val Gin Lys Glu Thr Cys 1065 

Leu Gly Cys Val Arg Ser Thr 1080 

Cys Leu Met Gly He Phe Leu 1095 

Pro Val Asp Phe Ala Gin Asn 1110 

Ser Arg Tyr Lys He Lys Asp 1125 

Asp His Ala He Ala Arg Gly 1140 

Glu Leu Lys Asn Leu Met He 1155 

Asp Val Tyr Gin Arg Val Arg 1170 

Asp Pro Thr Ala He Asn Thr 1185 

Met Val Ala Ser Arg Ser Tyr 1200 

His Leu Asp Val His Ala Leu 1215 

Asp Pro Asp Thr Glu Pro Asn 1230 

Met Val Pro Val Ser Thr Gin 1245 

Asn Gin Leu Cys Leu Asn Gly 1260 

Ser Glu Ala Asn Ala Tyr Ser 127 5 
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Phe Tyr Met Ser Lys Glu Arg Leu 
Arg Thr lie Asp Gly Asp Pro Asn 
Asp Leu Gly Phe Leu His Ser Val 
Gly Ala Pro Val Asp Met Gin Val 
5 Gly Asp Thr Phe Glu Val Asn Gly 
lie Glu Gin Ser Val Glu Arg Cys 
Gly Cys Ala Val Phe Gin Ala Gly 
Glu lie Phe Arg Arg Asn Phe Leu 
Val Asn Ala He Leu Asn Glu His 
10 Ser Phe Val Gin Lys Gly Asp Phe 
Lys Gin Arg Gly Lys He Leu Ala 
Arg Thr He Ala Gin Tyr Ser He 
Ser Gin Met He Thr Glu Glu Pro 
Gly Ser Met Leu Gly Arg Met Gly 
15 Gly Ser Thr Arg Ala Pro Ser Leu 
Asn Asn Leu Ser Leu Thr Gin Gin 
Pro Gly Met Tyr Ala Gin Gin Gin 
His Gin Phe Ser Met Ser Asn Thr 
Gly Val Glu Leu His Asp Pro Ser 
20 Arg His Ser Phe Leu Ala Asp Pro 
Met Asn Glu Thr Gly Ala Tyr Glu 
Tyr His Pro His Gin Gin Gin Tyr 
Arg Leu Ser Gly Pro Val Pro Asp 
Ser Gly Ser He Glu Gin His Asp 
25 Met Trp Asn Asn Arg Glu Tyr Tyr 
Gly Gly Tyr Thr Gin Asp Gly Asn 
Asp Glu Tyr Thr Ser Asn Ala Ser 
Gly Gly Gly Ser Gly Gly Gly Gly 
Asp Ser Ser Asp Ser Glu Gly Ala 
30 Asp Ala Leu Ala Gin He Asn Phe 
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Asp Ala Glu Arg Phe Asn Gly 1290 

Val Arg Tyr Val Arg Thr Gly 1305 

Thr Arg Pro He Gly Pro Asn 1320 

Leu Phe Val Leu Gly Ser He 1335 

Leu Asn His Phe Ser Met Asp 1350 

His Arg Asn He Val Pro Gly 1365 

Gly Leu Val Val Val Val Val 1380 

Ala Ser Met Val Pro Val He 1395 

Gin Leu Val He Asp He Val 1410 

His Arg Ser Arg Leu Gly Glu 1425 

Gly Trp Val Thr Arg Lys Met 144 0 

Arg Asp Pro Asn Gly Gin Asp 14 55 

Gly Pro Arg Ala Ser Met Thr 1470 

Gly Pro Ala Ser He Lys Ala 1485 

Met Gly Met Thr Ala Thr Met 1500 

Gin Gin Gin Gin Tyr Gin Gin 1515 

Gly Met His Pro Gin Gin Gin 1530 

Pro Pro Gin Gly Pro Pro Gin 154 5 

Asp Arg Thr Pro Thr Asp Asn 1560 

Arg Met Gin Asn Gin Gly Gin 1575 

Pro Met Asn Tyr Gin Asn Ala 1590 

Glu Ser Glu Asp Gly Gly Ser 1605 

Val Leu Arg Pro Gly Pro Ser 1620 

Gin Ala Asn Asn Asp Asn Asn 1635 

Gly Asn Ser Pro Ser Tyr Ala 1650 

He His Glu Gin Gin Gin His 1665 

Tyr Gly Gly Asn Gin Gly Ala 1680 

Gly Leu Arg Val Ala Asn Arg 1695 

Asp Asp Asp Ala Trp Arg Arg 1710 

Ala Gly Gly Ala Ala Ala Ala 1725 
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Ser Ala Gly Ala Pro Ala Ala Gly Ala Ser Ser Ser Gin Pro Gly 174 

. 174 
His Ala Gin 

In another embodiment of the invention, there is provided an 
isolated nucleic acid molecule encoding a CPS1 peptide synthetase homolog and 
5 which hybridizes to a nucleic acid molecule having a sequence corresponding to 
SEQIDNo. 41 as follows: 





AAGAAGAAAG GGCCGACCGA GTTGACCGAA ATATTGCTAG 


ATAAGGAAGC 


ACTGAAGCTG 


60 




AACGAAGTTG TTGTTTTGGC CATTGGAGAG GAAGTGAGCA 


AGCGTGTCAA 


CGAACCCGGC 


120 


ACTATGAGAG TCGGTGCTTT TGGCTACCCG ATACCAGATG 


CGACGCTGGC 


CGTCGTCGAT 


180 




CCGGAAACTA ATCTTTTGTG TTCACCCTAT TCCATAGGAG 


AG ATCTGGG T 


AGACTCGCCA 


240 


15 


TCATTGTCCG GAGGGTTTTG GCAGCTGCAG AAGCACACTG 


AGACTATTTT 


CCACGCTCGG 


300 




CCATATCGTT TCGTAGAGGG CAGCCCAACC CCGCAACTAC 


TCGAACTGGA 


GTTTCTACGC 


360 


on 
ZU 


ACTGGACTGC TCGGATGCGT GGTAGAAGGC AAAATCTTCG 


TATTAGGCCT 


GTACGAGGAC 


420 


CGGATTAGGC AGCGCGTTGA ATGGGTAGAG CACGGTCAGC TAGAAGCCGA ACATAGGTAT 


480 




TTCTTCGTGC AGCATCTTGT CACCAGCATT ATGAAAGCTG 


TTCCAAAGAT 


TTACGACTGG 


540 


25 


TAAGTGCTAT CGAATCTCTG GGTAATCAAC CTAACATTGC 


GCAGCTCGTC 


TTTCGATTCC 


600 




TATGTCAACG GCGAATACTT ACCAATCATC CTTATCGAGA 


CACAGGCCGC 


ATCAACTGCT 


660 


jU 


CCCACAAATC CAGGCGGGCC ACCACAACAA CTTGACATTC 


CTTTCCTAGA 


CTCTCTTTCT 


720 


GAGCGATGTA TGGAGGTACT GTAT CAAG AA CACCACCTTC 


GGGTGTATTG 


TGTGATGATC 


780 




ACTGCACCGA ACACACTCCC GCGAGTCATC AAGAACGGTC 


GACGAGAAAT 


TGGAAACATG 


840 


35 


CTTTGCCGGA GAGAATTTGA CAATGGCTCG CTACCCTGCG 


TTCACGTCAA 


GTTTGGCGTC 


900 




GAGAGGTCGG TCCAGAATAT TGCGCTAGGT GATGACCCTG 


CTGGCGGCAT 


GTGGTCTTAC 


960 


40 


GAGGCGTCGA TGGCACGCCA GCAGTTCCTG ATGCTTCAAG 


ATAAGCAGTA 


CTCTGGAGTA 


1020 


GATCACAGAG AAGTCGTTAT TGACGACAGA ACGTCGACGC CGCTCAACCA GTTCTCCAAC 


1080 




ATTCATGACC TTATGCAATG GCGCGTACAA CGACAAGCTG 


AAGAGCTCGC 


CTACTGCACG 


1140 


45 


GTAGATGGTC GAGGTAAAGA GGGCAAAGGC GTCAACTGGA 


AGAAGTTCGA 


CCAGAAGGTC 


1200 




GCAGGTGTCG CCATGTACCT GAAGAACAAG GTCAAGGGTC AGACTGGTGA 


CCACCTGCTC 


1260 


50 


TTGATGTACA CCCACTCGGA AGACTTTGTC TATGCCGTAC 


ACGCGTGTTT 


CGTCCTTGGA 


1320 


GCTGTGTGTA TACCCATGGC ACCAATCGAC CAGAACAGGC 


TAAATGAAGA 


CGCGCCCGCA 


1380 




CTACTACATA TCATTGCTGA CTTCAAGGTC AAGGCTATCC 


TCGTCAATGC 


TGGCGTAGAC 


1440 


55 


CACCTGATGA AGGTCAAGCA AGTATCGCAG CACATCAAAC 


AGTCAGCAGT 


CATTCTCAAG 


1500 




ATCAACGTAC CGAATACCTA TAACACCACA AAACCACCTA 


AGCAGTCTAG 


TGGTTGCCGC 


1560 
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GATCTTAAGC 


TCACAATACG 


ACCTGCTTGG 


ATACAATCTG 


GTTTCCCTGT 


TCTAGTATGG 


1620 


ACATACTGGA 


CACCTGACCA 


GAGACGCATA 


GCTGTGCAAT 


TAGGTCATAG 


CCAAATCATG 


1680 


GCGCTATGCA 


AAGTTCAGAA 


AGAAACGTGC 


CAGATGACGA 


GCACACGGCC 


CGTCCTTGGA 


1740 


TGTGTTCGTA 


GCACGATCGG 


TCTTGGCTTC 


ATACACACCT 


GTGTTATGGG 


TATCTTCCTC 


1800 


GCAGCGCCAA 


CTTACCTTGT 


GTCACCTGTC 


GATTTTGCGC 


AAAACCCGAA 


CATCCTCTTC 


1860 


CAGACCATGT 


CGAGATACAA 


GATCAAGGAC 


GCGTATGCGA 


CCAGCCAAAT 


GCTGGACCAC 


1920 


GCTATTGCAC 


GAGGTGCTGG 


CAAGAACATG 


GCTCTGCACG 


AGCTCAAGAA 


CCTCATGATC 


1980 


GCGACTGACG 


GTCGGCCGCG 


CGTAGACGTC 


TGTAAGTGTT 


GCGATCCTGT 


ATAAGCATCT 


2040 


GAAATCTAAT 


TCTTGATAGA 


CCAGCGTGTG 


CGAGTACACT 


TCTCGCCAGC 


AAGTTTGGAC 


2100 


CGAACGGCAA 


TCAATACTGT 


TTACTCACAC 


GTACTGAATC 


CTATGGTCGC 


ATCGCGGTCA 


2160 


TACATGTGCA 


TCGAACCCAT 


AGAACTACAT 


CTCGATGTCG 


GTGCCCTTCG 


AAGAGGTCTC 


2220 


ATCATGCCTG 


TCGACCCAGA 


CACGGAACCT 


GGTGCTCTCT 


TAGTCCAGGA 


CTCGGGTATG 


2280 


GTACCAGTTA 


GTACACAAAT 


TTCAATCGTG 


AATCCAGAGA 


CAAACCAGCT 


TTGCCTAGTC 


2340 


GGCGAGTATG 


GCGAAATCTG 


GGTCCAACC 








2370 



10 



15 



20 



25 



Preferably, the CPS1 nucleic acid molecule which hybridizes to a 
30 nucleic acid molecule having a sequence as set forth in SEQ ID No. 4 1 is from the 

plant pathogen Altenaria solanL In another preferred embodiment, the CPS1 gene 

is from a plant pathogen such as Alternaia alternatherae, A. alternata, A. 

amaranthi, A. araliae, A. brassicae, A, brassicicola, A. camelliae, A, cassiae, A. 

cheiranthi, A. cinerariae, A. gossypii, A. helianthi, A. helianthinfwiens, A. mali, 
35 or A. raphani. 

The peptide synthetase product of SEQ ID NO:41 has a deduced 
amino acid sequence as follows (SEQ ID NO:42): 

KKKGPTELTE ILLDKEALKL NEWVLAIGE EVSKRVNEPG TMRVGAFGYP I PDATLAWD 60 
40 PETNLLCSPY SIGEIWVDSP SLSGGFWQLQ KHTETIFHAR PYRFVEGSPT PQLLELEFLR 120 
TGLLGCWEG KIFVLGLYED RIRQRVEWVE HGQLEAEHRY FFVQHLVTSI MKAVPKIYDC 180 
SSFDSYVNGE YLPIILIETQ AASTAPTNPG GPPQQLDIPF LDSLSERCME VLYQEHHLRV 240 
YCVMITAPNT LPRVI KNGRR EIGNMLCRRE FDNGSLPCVH VKFGVERSVQ NIALGDDPAG 300 
GMWSYEASMA RQQFLMLQDK QYSGVDHREV VIDDRTSTPL NQFSNIHDLM QWRVQRQAEE 360 
50 LAYCTVDGRG KEGKGVNWKK FDQKVAGVAM YLKNKVKGQT GDHLLLMYTH SEDFVYAVHA 420 
CFVLGAVCIP MAPIDQNRLN EDAPALLHII ADFKVKAILV NAGVDHLMKV KQVSQHIKQS 480 



45 
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AVILKINVPN TYNTTKPPKQ SSGCRDLKLT IRPAWIQSGF PVLVWTYWTP DQRRIAVQLG 
HSQIMALCKV QKETCQMTST RPVLGCVRST IGLGFIHTCV MGIFLAAPTY LVSPVDFAQN 
PNILFQTMSR YKIKDAYATS QMLDHAIARG AGKNMALHEL KNLMIATDGR PRVDVYQRVR 
VHFSPASLDR TAINTVYSHV LNPMVASRSY MCIEPIELHL DVGALRRGLI MPVDPDTEPG 
ALLVQDSGMV PVSTQISIVN PETNQLCLVG EYGEIWVQ 



540 
600 
660 
720 
760 



360 
420 
480 



In another embodiment of the invention, there is provided an 
isolated nucleic acid molecule encoding a CPS 1 peptide synthetase homolog and 
which hybridizes to a nucleic acid molecule having a sequence corresponding to 
SEQ ID No. 43 as follows : 

15 CTCGAGGTTA GTAAAAGATC CCCGTTTGTT CCACAAATCT CCATCTCCCT CTCAATGCCT 60 

TTCTTGGCGC CTCAACCCGC TATTTTGAAG ACAGTTTGTT GTTGTCGCAT GCGACCAAAA 120 

ATCATCCTCT CAAGTTTTCA TCGCTGACCT GTTTCTTGGC GTAGGAAGGA GATATCACAC 180 

20 AGAAAGGGTA AGCTGCTTTG CGTCCAGAGT ACTTACAATT GCTTCTCAAT TACTTACGCG 240 

CCGGCAGCTA CCAAAAGCGA CGAACTCAAC TTTTCTCCCA ATTCCTCGGT GCACCTCCAC 300 
25 CTCAGATTGC TGCTCTCGCC GAGCCTCAGT CTGGCCTACG CATACACTCG CCCGATGACT 
CCGACCACCC TTCAGGCGAT GGCCATCGCG CTACCGCCTA TGCCGCTCTC GGTAGCAGCA 
GCGGTCCAAT CCCAGATTCA CCAGACTCAC CTATGTACCG ACCGCACTCT GGTTATGCTC 

30 CTTCAGAATC ACCAAGACCT TCTCCAGCAC AACCTCCACC TTCCCTGCTG CGCCCGGGGG 540 

GTTCTCTCGC TGGAGGATCG ACCACTGCTC ACCGCGACTC CCTCTTCTTC TCCCCCTCCC 600 

35 ATCTCGAACC TGAAACCCGG ACAGGTACTA TGATGTCGGG CGACTATGCA TTCAGACCCG 660 

AGCAGCAAGG CACATATGGC GAATCCCAGC ATCAACAGCA CCAGTTCCAG CAACAGCAAC 720 

AGCCACAGCA GCAACAGCAG TACGATGGGC AGCAGTATGA TGGACGAACT ACAACGCTTC 780 

40 TCGATTCGCA AGGATACTTT TCGGATTTTG CGGGACAGCA GCACTATGAT CAGACTCAAA 840 

CCGTTGAGTA TGTGGGACCT CAGCAGCGGT ATTCTTCCAG CGATGCATTC TCTCCAACCG 900 

45 CCGCAATGGC ACCTCCAATG CTTACAACCA ACGACCTCCC ACCGCCGGAA GCGCTTGAGT 960 

ACCAGCTGCC CCTTGACCCT CGCGAGGTAC CATTCGCTAT TCAAGATCCC CATGATGATT 1020 

CTACGCCAAT GTCAAAGTTC GATAACATCG CAGCTGTACT CAGACATAGA GGCCGAACGA 1080 

50 TTG CTAAGAA GCCGGCATAC TGGGTGTTGG ATAGTAAGGG CAAGGAGATT GCATCGATTA 1140 

CGTGGGATAA GCTGGCATCT AGAGCCGAAA AGGTTGCGCA AGTCATCCGC GACAAAAGCT 1200 

55 CTCTGTACCG GGGTGATCGG GTTGCTCTCA TCTACCGCGA TTCAGAGGTT ATTGATTTCG 1260 

CCATTGCCTT GCTGGGATGC TTCATTGCTG GAGTTGTTGC CGTTCCCATC AATGATCTGC 1320 

AGGACTACCA ACGCTTGAAC CACATTCTTA CTACAACGCA GGCCCATCTA GCGCTGACCA 1380 

60 
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CCGATAACAA CCTCAAAGCC TTTCAACGAG 
AGGGTGTCGA ATGGTGGAAG ACAAACGAGT 
5 ATGTCCCGGC TTTGGTTGTT CCCGATCTGG 
GAGACTTGAG AGGTGTTGTT CTGAGCCACC 
GTGCGATTAT TTCTACTATC CCGGGTAATG 

10 

GCGACAAGAA TGGTCGACTT ATTGGTGGCG 
TCGATCCCCG TCAGGGCATT GGCATGATTC 
15 ACACCACTGT TTGGTTCGAC AACAAAGCTG 
TTACCAAGTA CAAATCGACC ATCATGATTG 
ACAACTACCA GCAAGAGCCA ATGGTGACCC 

20 

TTCAAATGAT CAAGCTTTGC TTGATTGACA 
TTTTGGCTGA CCGATGGCTA CGACCGTTGA 
25 CTATGCTTTG TCTACCTGAA CACGGAGGCA 
GAGAAGAGCG CATGGGATGC CCATTAAAGC 
AAGAGAAAGA GGAAACAGAG AAGCCAGCAG 

30 

GTGGTGGCAC AGCAACAACC GAAGAGAGGG 
ATCGTGAGGC TCTAAAGACC AACGAAGTTG 
35 AAAGGGTGAC GGATGACCCA GGCTTGGTAC 
ATGCCACACT CTCCGTCGTC GATCCAGAAA 
GTGAAATCTG GGTCGACTCC CCTTCTCTTT 

40 

CTGAGCTGAT TTTCCATGCT CGTCCTTACA 
CCGTCGAGCC CGAATTCCTG CGAACAGGCT 
45 TTGTTCTGGG CCTTTACGAA GACCGAATTC 
ACGAACTAGC AGAGTACCGC TACTTCTTTG 
ACGTTCCAAA GATATACGAT TGTTCAGCCT 

50 

CAGTCGTGGT GCTGGAGTCA GCAGCTGCGT 
CTCGACAACC GGATACAGCT CTGCTAGAGT 
55 TGTCAGAGCA TCATCTGAGA CTGTACTGCG 
GAGTTGTTAA GAACGGACGA CGCGAAATTG 
TCGGCAACCT TCCATGTGTG CACGTCAAGT 

60 

CTATTGGTGT AGACCCTATA GGTGGTATCT 
AATTCTTATT GCCAGCTGAC AAGCAATACT 
65 ATGACCGTAC TTCAACGCCC CTAAACAATT 

SUBSTITUTE 
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ACATTACTAC ACAAAAGTTG ACATGGCCAA 144 0 

TTGGCAGTTA TCACCCCAAG AAGAAGGAGG 1500 

CATATATCGA GTTTTCGCGG GCCCCAACTG 1560 

GAACCATTAT GCACCAAATG GCTTGTCTTA 1620 

GACCTGGCGA CACTTTCAAC CCGTCTCTTC 1680 

GCGCAAGCAG CGAAATTTTG GTGTCGTACC 1740 

TGAGCGTGCT ACTGACCGTC TACGGCGGCC 1800 

TTGATGTTCC TGGACTGTAC GCCCACCTCC 1860 

CCGACTACCC AGGATTGAAG CGAGCCGCCT 1920 

GAAATTTTAA GAAGGGAATG GAGCCAAACT 1980 

CCTTGACTGT AGACAGCGGG TCCCACGAAG 204 0 

GAAACCCTCG TGCCCGTGAG GTTGTCGCAC 2100 

TGGTGATTAG TGTGCGTGAC TGGCTAGGAG 2160 

TTGAACTTGG GGAGGATACA GAGTCTGACG 2220 

TTTCCAATGG CTTTGGTAGT CTCTTGTCAG 2280 

CAAAGAATGA GCTTGGCGAA GTCCTTTTGG 2340 

TGGTGGTGGC CATAGGTAAC GATGCCCGTA 2400 

GGGTCGGTTC TTTTGGATAC CCCATACCCG 2460 

CGGGTTTACT GGCGTCACCA CATTCCGTGG 2520 

CAGGTGGTTT CTGGGCGCAG CCAAAGAATA 2580 

AGTTTGACCC AGGTGATCCT ACACCGCAGC 2640 

TGCTGGGCAC CGTCATCGAG GGTAAAATCT 2700 

GACAAAAGGT TGAGTGGGTT GAGCATGGAC 2760 

TTCAGCACAT CGTTGTGAGC ATTGTCAAGA 2820 

TTGACGTCTT TGTCAATGAC GAACACCTGC 2880 

CAACGGCACC ATTGACATCT GGAGGACCTC 2940 

CATTGGCTGA GCGCTGCATG GAGGTTCTCA 3000 

TTATGATCAC AGCACCCGAC ACTTTGCCTC 3 060 

GTAACATGCT TTGCCGTCGG GAGTTTGATC 3120 

TTGGCGTGGA GCATGCAGTA CTTAACCTCC 3180 

GGTCACCGTT GGCGTCCGAT TCTCGTGCCG 3240 

CTGGTGTCGA CAGGCGCGAA GTCGTTATCG 33 00 

TCTCTTGCAT TTCGGATCTT ATCCAATGGC 3360 
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GCGTGGCCCG 


TCAACCAGAA 


GAGCTAGCGT 


ACTGCACAAT 


CGATGGCAAA 


AGCCGAGAAG 


3420 




GTAAGGGTGT 


AACATGGAAG 


AAATTCGACA 


CCAAGGTCGC 


TTCCGTTGCC 


ATGTACCTGA 


3480 


5 


AGAACAAGGT 


CAAGGTGAGG 


CCGGGAGACC ACATCATCCT 


CATGTACACA 


CATTCAGAGG 


3540 




AGTTTGTCTT 


TGCCATCCAT 


GCCTGCATTT 


CCTTGGGCGC 


AATTGTCATT 


CCCATCGCAC 


3600 


10 

1 \J 


CCCTCGACCA GAACCGATTG AACGAAGATG 


TCCCAGCTTT 


CCTGCATATT 


GTATCTGATT 


3660 


ACAACGTCAA GGCTGTGCTG 


GTCAACGCTG 


AGGTCGATCA 


TCTAATCAAG 


GTAAAGCCTG 


3720 




TGGCTAGCCA 


TATCAAACAG 


TCAGCCCAGG 


TTCTCAAGAT 


CACGAGCCCT 


GCCATCTACA 


3780 


15 


ACACAACTAA 


GCCGCCAAAG 


CAAAGTAGTG 


GATTGAGGGA 


TTTGAGATTC 


ACCATTGACC 


3840 




CTGCCTGGAT 


TCGGCCTGGC 


TACCCCGTCA 


TTGTTTGGAC 


TTATTGGACC 


CCCGATCAAC 


3900 


Z.\J 


GACGAATTTC 


AGTTCAGCTT 


GGACATGACA 


CCATTATGGG 


CATGTGCAAG 


GTTCAAAAGG 


3960 


AAACTTGCCA 


AATGACAAGT 


TCAAGACCTG 


TGCTTGGATG 


TGTACGAAGC 


ACGACTGGCC 


4020 




TAGGCTTTAT 


TCATACGGCT 


CTGATGGGAA 


TTTATATCGG 


AACACCAACC 


TACCTCCTAT 


4080 


25 


CACCTGTCGA 


GTTTGCAGCC 


AACCCCATGT 


CTCTATTCGT 


CACCTTGTCG 


AGATACAAGA 


4140 




TTAAGGATAC 


TTATGCGACA 


CCACAGATGC 


TTGATCATGC 


CATGAACTCC 


ATGCAGGCCA 


4200 




AGGGCTTTAC ACTTCATGAA CTTAAGAACA TGATGATCAC 


TGCCGAGAGC 


CG AC CAAG AG 


4260 


TTGATGTTTT 


CCAAAAGGTC 


AGACTTCACT 


TTGCTGGGGC 


TGGGCTCGAT 


AGAACTGCTA 


4320 




TTAACACGGT 


CTATTCGCAT 


GTCCTCAACC 


CCATGGTAGC 


GTCGCGATCT 


TATATGTGCA 


4380 


35 


TCGAGCCTAT 


TGAGCTTTGG 


TTGGACACGC 


AAGCGCTTCG 


ACGTGGTCTG 


GTTATTCCTG 


4440 




TGGACCCTGA ATCAGATCCT 


CTGGCCCTAC 


TGGTACAGGA 


CAGCGGTATG 


GTTCCAGTTT 


4500 


40 

TV 


CAACCCAAAT 
GTGAAATTTG 


AGCCATCATC 
GGTCGACTCT 


AACCCTGAAA 
GAAGCCTGCG 


GCAGAATACA 
TCAAGTCATT 


CTGCCTCGAT 
CTATGGCTCC 


GGTGAGTATG 
AAAGACGCTT 


4560 
4620 




TTGACGCTGA 


GCGCTTTGAT 


GGCCGAGCTC 


TTGACGGCGA 


TCCCAACATT 


CAGTATATCC 


4680 


45 


GTACCGGAGA 


CTTGGGTTTC 


CTTCATAATG 


TTAGTCGACC 


TATTGGCCCT 


AATGGTGCCC 


4740 




AGGTGGACAT 


GCAAGTGTTG 


TTTGTTCTCG 


GCAACATTGG 


CGAGACTTTT 


GAGATCAACG 


4800 


50 


GATTGAGCCA 


TTTCCCAATG 


GATATTGAGA ACTCGGTGGA 


AAAATGCCAC 


AGAAACATTG 


4860 


TGGCGAATGG 


CTGGTAAGTA 


TAAAATCTCT 


ATTTGAAGCG 


AATATGCTAA 


CAAAGTCAGT 


4920 




GCGGTGTTCC 


AAGCTGGTGG 


CTTGGTGGTT 


GTTCTGGTTG 


AAGTCAACCG 


CAAGCCATAC 


4980 


55 


CTGGCATCGA 


TTGTTCCCGT 


CATTGTCAAC 


GCTATCCTCA 


ATGAACACCA 


AATCATTGTA 


5040 




GATATCGTCG 


CATTCGTCAA 


CAAGGGAGAC 


TTCCCACGGT 


CTCGTCTAGG 


AGAGAAGCAG 


5100 


60 


CGTGGCAAGA 


TTCTTGGTGG 


CTGGGTTAGT AGAAAGCTGA 


GGACTCTTGC 


CCAGTTCTCG 


5160 


ATTCGCGATA 


TGGACGCCGA 


ATCCACAGCT 


GGTGATATGA 


TGGATCCTTC 


TAGAGCATCA 


5220 




ATGGTCAGCG 


TACGAAGCGG 


AGGCGGTGCT 


GCTCCCGGAT 


CTTCTAGTTT 


GAGGAATGTC 


5280 


65 


GAACCTGCGC 


CTCAAATCTT 


GGAGGAGGAA 


CATGACCAGA 


TGACTCCTCG 


TCACGAATAC 


5340 
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GAAGCAGCCC CTACCATGAT TTCTGAACTT CCCGACGGCC AAGAGACACC GACAGGGTTT 5400 

CAGCACTCGC AATACGAACA CCCACCACAA TCAGCCGGTT CTCAAGCACC AGCCCAGCTG 5460 

AACCTTTCTC ACCAGCCCGA TCAAGGATTC GATATGGACT TTTCACGATA TAGTTCAGCA 5520 

GAGCCCGATC ACGGCCCTGT CCACAGACGT CCAGTCCCAG GCCAAGCCCA ACAACCCGAG 5580 

CCTATGCAAG GGTACGGTCA AGCGCCGCCC CAGATCCGGC TACCAGGTGT TGATGGACGA 5640 

GAGGAGGGAG GGTTCTGGTC ACAGCAGGAA AAGAACGAGA AGAGTGAAGA AGACTGGACA 5700 

ACTGATGCCA TGATGCATAT GAATCTGGCA GGTGATATGA AACCGCCACG ATGATAATAC 5760 

15 ACAACATAAG AGCGAAGTGA CGAAGCGGAG TCGGAGTTGG GAAGCATTTA GAAACGAATA 5820 

ACAAACAATT GGACTTGTCG GTCTGATGGC CTATTTACTT CATTCATAGA TGAGGATTGG 5880 

ATAGTGAATA TGTGATTGGA TAAAGCCTGG GTTTGTGAGT TTGTGAATGC AGTGGGTGCT 5940 

TGCTATAAGC TGTTTTATTG AGGTCTTTGG AGGAGTGTCT AACAAAGATG CAAAGTTACT 6000 

6003 

AGT 



20 



25 Preferably, the CPS 1 nucleic acid molecule which hybridizes to a 

nucleic acid molecule having a sequence as set forth in SEQ ID No. 43 is from the 
plant pathogen Fusarium graminearium. In another preferred embodiment, the 
CPS1 gene is from a plant pathogen such as Fusarium avenaceum, F. carpineum, 
F. chlamydosporum, F. coccophilum, F. culmorum, F. episphaeria, F. equiseti, F. 

30 jlocciferum, F. moniliforme, F. oxysporum, F. redolens, F. sambucinum, F. solani, 
F. subglutinans, F. trichothecioides, F. udum, or F. ventricosum. 

The peptide synthetase product of SEQ ID NO:43 has a deduced 
amino acid sequence as follows (SEQ ID No. 44): 



35 


MMSGDYAFRP EQQGTYGESQ HQQHQFQQQQ 


QPQQQQQYDG QQYDGRTTTL LDSQGYFSDF 


60 


AGQQHYDQTQ TVEYVGPQQR YSSSDAFSPT AAMAPPMLTT 


NDLPPPEALE 


YQLPLDPREV 


120 




PFAIQDPHDD STPMSKFDNI AAVLRHRGRT 


IAKKPAYWVL 


DSKGKEIASI 


TWDKLASRAE 


180 


40 


KVAQVIRDKS SLYRGDRVAL IYRDSEVIDF 


AIALLGCFIA GWAVPINDL QDYQRLNHIL 


240 




TTTQAHLALT TDNNLKAFQR DITTQKLTWP 


KGVEWWKTNE 


FGSYHPKKKE 


DVPALWPDL 


300 


45 


AYIEFSRAPT GDLRGWLSH RTIMHQMACL 


SAIISTIPGN 


GPGDTFNPSL 


RDKNGRLIGG 


360 


GASSEILVSY LDPRQGIGMI LSVLLTVYGG 


HTTVWFDNKA 


VDVPGLYAHL 


LTKYKSTIMI 


420 




ADYPGLKRAA YNYQQEPMVT RNFKKGMEPN FQMIKLCLID 


TLTVDSGSHE 


VLADRWLRPL 


480 


50 


RNPRAREWA PMLCLPEHGG MVISVRDWLG 


GEERMGCPLK 


LELGEDTESD 


EEKEETEKPA 


540 




VSNGFGSLLS GGGTATTEER AKNELGEVLL 


DREALKTNEV 


WVAIGNDAR 


KRVTDDPGLV 


600 




RVGSFGYPIP DATLSWDPE TGLLASPHSV 


GEIWVDSPSL 


SGGFWAQPKN 


TELIFHARPY 


660 
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KFDPGDPTPQ PVEPEFLRTG LLGTVIEGKI FVLGLYEDRI RQKVEWVEHG HELAEYRYFF 720 

VQHIWSIVK NVPKIYDCSA FDVFVNDEHL PVWLESAAA STAPLTSGGP PRQPDTALLE 780 

5 

SLAERCMEVL MSEHHLRLYC VMITAPDTLP RWKNGRREI GNMLCRREFD LGNLPCVHVK 840 

FGVEHAVLNL PIGVDPIGGI WSPLASDSRA EFLLPADKQY SGVDRREWI DDRTSTPLNN 900 

10 FSCISDLIQW RVARQPEELA YCTIDGKSRE GKGVTWKKFD TKVASVAMYL KNKVKVRPGD 960 

HIILMYTHSE EFVFAIHACI SLGAIVIPIA PLDQNRLNED VPAFLHIVSD YNVKAVLVNA 1020 

EVDHLIKVKP VASHIKQSAQ VLKITSPAIY NTTKPPKQSS GLRDLRFTID PAWIRPGYPV 1080 

IVWTYWTPDQ RRISVQLGHD TIMGMCKVQK ETCQMTSSRP VLGCVRSTTG LGFIHTALMG 1140 

IYIGTPTYLL SPVEFAANPM SLFVTLSRYK IKDTYATPQM LDHAMNSMQA KGFTLHELKN 1200 

20 MMITAESRPR VDVFQKVRLH FAGAGLDRTA INTVYSHVLN PMVASRSYMC IEPIELWLDT 1260 

QALRRGLVIP VDPESDPLAL LVQDSGMVPV STQIAIINPE SRIHCLDGEY GEIWVDSEAC 1320 

VKSFYGSKDA FDAERFDGRA LDGDPNIQYI RTGDLGFLHN VSRPIGPNGA QVDMQVLFVL 1380 

GNIGETFEIN GLSHFPMDIE NSVEKCHRNI VANGCAVFQA GGLVWLVEV NRKPYLASIV 144 0 

PVIVNAILNE HQIIVDIVAF VNKGDFPRSR LGEKQRGKIL GGWVSRKLRT LAQFSIRDMD 1500 

30 AESTAGDMMD PSRASMVSVR SGGGAAPGSS SLRNVEPAPQ ILEEEHDQMT PRHEYEAAPT 1560 

MISELPDGQE TPTGFQHSQY EHPPQSAGSQ APAQLNLSHQ PDQGFDMDFS RYSSAEPDHG 1620 

PVHRRPVPGQ AQQPEPMQGY GQAPPQIRLP GVDGREEGGF WSQQEKNEKS EEDWTTDAMM 1680 

HMNLAGDMKP PR 1692 



25 



35 



In another embodiment of the invention, there is provided an 
isolated nucleic acid molecule encoding a CPS1 peptide synthetase homolog and 
40 which hybridizes to a nucleic acid molecule having a sequence corresponding to 
SEQIDNo. 45 as follows: 



45 



AAAAAGAAGG GGCCTACGGA GTTGACCGAG ATATTGCTAG ATAAGGAAGC GCTCAAGATG 60 

AACGATGTTG TGGTCCTTGC AATAGGAGAA GAGGCCAGTA AACGTGCGAA TGAGCCTGGC 120 

ACAATGCGAG TTGGCGCTTT TGGATACCCA ATACCAGATG CGACGCTAGC CGTCGTAGAT 180 

CCAGAGACGA ATCTCTTGTG TTCACCCTAC TCGATAGGAG AGATTTGGGT AGACTCACCT 240 

50 TCATTGTCTG GTGGTTTCTG GCAATTGCAG AAGCACACTG AAACTATATT TCACGCCCGC 300 

CCATACCGCT TTGTGGAGGG CAGTCCTACC CCGCAGTTGC TTGAGCTTGA GTTTCTCCGG 360 

ACAGGCTTAC TCGGATTCGT CGTAGAGGGC AAGGTCTTTA TCCTTGGTCT CTATGAAGAT 420 

CGCATCAGGC AGCGCGTTGA ATGGGTAGAA CATGGTCAGC TGGAAGCTGA ACACAGATAC 4 80 

TTCTTCGTGC AGCACCTCGT CACCAGTATC ATGAAGGCTG TTCCCAAGAT CTACGACTGG 540 

60 TAAGTCTTCT CATGTTTTAG ATGAGCGTTC TAACACTATG CAGCTCATCT TTCGACTCGT 600 



55 
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ACGTCAATGG 


CGAATACCTG 


CCTATCATCC 


TCATCGAGAC 


ACAGGCTGCA 


TCGACAGCCC 


660 


CTACGAACCC 


TGGTGGAC CG 


CCACAGCAAC 


TCGACATCCC 


CTTCCTAGAC 


TCACTGTCTG 


720 


AGCGATGCAT 


GGAAGTGTTG 


TATCAAGAAC 


ACCATCTGCG 


AGTATACTG C 


GTCATGATCA 


780 


CAGCGCCAAA 


CACATTACCA 


CGAGTTGTTA AGAATGGTCG ACGAGAAATT GGCAACATGC 


840 


TCTGTCGAAG 


AGAATTTGAT 


AATGGCTCAT 


TACCTTGTGT 


CCACGTCAAG 


TTTGGTGTTG 


900 


AGAGGTCAGT 


TCTCAACATC 


GCGTTGGGTG 


ATGACCCCTC 


CGGAGGCATG 


TGGTCATATG 


960 


AAGCCTCGAT 


GGCGCGTCAG 


CAGTTCTTGA 


TGCTCCAAGA 


CAAGCAGTAT 


TCTGGAGTAG 


1020 


ATCACCGCGA 


AGTCGTCATG 


GATGACAGAA 


CATCGACACC 


TCTCAACCAA 


TTCTCCAACA 


1080 


TTCACGACCT 


CATGCAATGG 


CGCGTATCAC 


GGCAGGCTGA 


AGAGCTCGCA 


TATTG GACAG 


1140 


TCGACGGTCG 


AGGCAAAGAA 


GGCAAGGGCG 


TCAACTGGAA 


GAAGTTCGAC 


CAGAAAGTTG 


1200 


CGGGTGTCGC 


AATGTACCTG 


AAGAACAAGG 


TCAAAGTGCA AACCGGCGAT 


CATCTGCTTC 


1260 


TGATGTATAC 


GCACTCGGAA 


GACTTTGTAT 


ATGCGGTACA 


TGCATGCTTT 


GTGCTTGGCG 


1320 


CTGTATGCAT 


ACCAATGGCA 


CCAATCGACC 


AGAACCGATT 


GAATGAGGAT 


GCACCTGCAT 


1380 


TGCTGCACAT 


CCTTGCAGAC 


TTCAAGGTCA AGGCCATCCT 


CGTCAATGCC 


GATGTGGATC 


1440 


ATCTCATGAA 


GGTCAAGCAA 


GTATCGCAGC 


ACATCAAACA 


ATCAGCAGCC 


ATCTTCAAGA 


1500 


TCAACGTGCC 


GCACACTTAC 


AACACAACCA 


AGCCACCTAA 


GCAGTCGAGT 


GGTTGTCGGG 


1560 


ATCTCAAGCT 


CACAATACGG 


CCTGCCTGGG 


TACAGCCTGG 


TTTCCCAGTT 


CTTGTATGGA 


1620 


CATACTGGAC 


TCCAGATCAA 


CGCCGTATAG 


CCGTACAACT 


AGGTCATAGC 


CAAATCATGG 


1680 


CACTAGGCAA 


GGTCCAGAAG 


GAGACTTGTC 


AAATGACAAG 


TACAAGGCCA 




1740 


GTGTACGGAG 


TACCATCGGA 


CTTGGCTTCA 


TTCATACCTG 


CATCATGGGC 


ATCTTCCTTG 


1800 


CCGCACCCAC 


TTACCTCGTG 


TCGCCTGTCG 


ACTTTGCACA 


AAATCCAAAC 


ATACTCTTCC 


1860 


AGACGTTATC 


AAG A 1 Av-AACj 


ATCAAGAATG 


CGTACGCAAC 


CAGTCAAATG 


TTGGATCACG 


1920 


CTATTGCCCG 


TGGGGCTGGA 


AAGAACATGG 


CCCTGCACGA 


ACTCAAGAAT 


CTCATGATTG 


1980 


CGACTGATGG 


TAGGCCGCGT 


GTTGATGTTT 


ACCAGAGAGT 


GCGCGTACAC 


TTTTCACCAG 


2040 


CAAGCTTGGA 


CCGGACAGCG 


ATTAACACAG 


TCTACTCTCA 


CGTGCTCAAC 


CCAATGGTAG 


2100 


CATCGCGATC 


ATACATGTGC 


ATCGAGCCAA 


TAGAACTGCA 


TCTCGACGTC 


AACGCTCTTC 


2160 


GAAGAGGTCT 


GATCATGCCC 


GTCGACCCAG 


ATACCGAGCC 


TGGCGCTCTA 


ATGGTCCAGG 


2220 


ACTCTGGTAT 


GGTGCCAGTC 


TCCACACAAA TAGCAATTGT 


GAACCCAGAG 


ACAAACCAGC 


2280 


TTTGCTTGGT 


TGGCGAATAT 


GGCGAAATCT 


GGG TTCAATC 






2320 



60 Preferably, the CPS 1 nucleic acid molecule which hybridizes to a 

nucleic acid molecule having a sequence as set forth in SEQ ID No. 45 is from the 
plant pathogen Pyrenophora teres. In another preferred embodiment, the CPS1 
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gene is from a plant pathogen such as Pyrenophora avenae, P. bromi, P. 
leuceienes, P, phaeocomes, P. schroeteri, P. trichostoma, or P. tritici-repentis. 



The peptide synthetase product of SEQ ID No:45 has a deduced 





amino acid sequence as follows (SEQ ID No. 46): 






5 


KKKGPTELTE 


ILLDKEALKM NDVWLAIGE EASKRANEPG TMRVGAFGYP 


I PDATLAWD 


60 




PETNLLCSPY 


SIGEIWVDSP SLSGGFWQLQ KHTETIFHAR PYRFVEGSPT 


PQLLELEFLR 


120 


1 0 


TGLLGFWEG 


KVFILGLYED RIRQRVEWVE HGQLEAEHRY FFVQHLVTS I 


MKAVPKIYDC 


180 


SSFDSYVNGE 


YLPIILIETQ AASTAPTNPG GPPQQLDIPF LDSLSERCME 


VLYQEHHLRV 


240 




YCVMITAPNT 


LPRWKNGRR EIGNMLCRRE FDNGSLPCVH VKFGVERSVL 


NIALGDDPSG 


300 


15 


GMWSYEASMA 


RQQFLMLQDK QYSGVDHREV VMDDRTSTPL NQFSNIHDLM QWRVSRQAEE 


360 




LAYCTVDGRG 


KEGKGVNWKK FDQKVAGVAM YLKNKVKVQT GDHLLLMYTH 


SEDFVYAVHA 


420 


20 


CFVLGAVCIP 
AAIFKINVPH 


MAPIDQNRLN EDAPALLHIL ADFKVKAILV NADVDHLMKV 
TYNTTKPPKQ SSGCRDLKLT IRPAWVQPGF PVLVWTYWTP 


KQVSQHIKQS 
DQRRIAVQLG 


480 
540 




HSQIMALGKV QKETCQMTST RPVLGCVRST IGLGFIHTCI MGIFLAAPTY 


LVSPVDFAQN 


600 


25 


PNILFQTLSR 


YKIKNAYATS QMLDHAIARG AGKNMALHEL KNLMIATDGR 


PRVDVYQRVR 


660 




VHFSPASLDR 


TAINTVYSHV LNPMVASRSY MCIEPIELHL DVNALRRGLI 


MPVDPDTEPG 


720 




ALMVQDSGMV 


PVSTQIAIVN PETNQLCLVG EYGEIWVQ 




758 



30 

As used herein, the term "nucleic acid" refers to deoxyribonucleic 
acid (DNA) or ribonucleic acid and polymers thereof in either a single or double 
stranded form. As used herein, "nucleic acid" also encompasses nucleic acids 
containing known analogs of naturally occurring nucleotides. Unless otherwise 

35 indicated, a particular nucleic acid sequence also encompasses conservatively 
modified variants thereof such as degenerate codon substitutions and 
complementary sequences. As used herein, the term "nucleotide sequence" refers 
to a polymer of DNA or RNA which may be single or double stranded and may 
contain synthetic, non-natural, or altered nucleotide bases capable of incorporation 

40 into DNA or RNA polymers. The terms "nucleic acid" , "nucleic acid molecule", 
"nucleic acid fragment", or nucleic acid sequence or segment", may also be used 
interchangeably with the terms "gene", "cDNA", "DNA" and "RNA". 

Other DNA molecules of the present invention include DNA 
molecules that have a nucleic acid sequence which is more than 70% identical to 
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the nucleotide sequence of SEQ. ID. Nos. 2, 41, 43, or 45. Nucleotide sequence 
similarity may be determined by the BLAST program with the default parameters 
(Altschul et al., "Basic Local Alignment Search Tool," J. Mol. Bio L, 215:403-410 
(1990), which is hereby incorporated by reference). 

5 Preferred sequences include those DNA molecules which will 

hybridize to a nucleic acid molecule having the sequence of SEQ. ID No. 2, 41 , 
43, 45 or their compliments. Generally, stringent conditions are selected to be 
about 50°C lower than the thermal melting point (T m ) for the specific sequence at 
a defined ionic strength and pH. The T m is the temperature (under defined ionic 

10 strength and pH) at which 50% of the target sequence hybridizes to a perfectly 
matched probe. The T m is dependent upon the solution conditions and the base 
composition of the probe, and may be calculated using the following equation: 

T m = 79.8°C + (18.5xLog[Na+]) 
+ (58.4°C x %[G+C]) 
15 - (820 / #bp in duplex) 

(0.5 x % formamide) 
More preferred stringent conditions are when the temperature is 
20°C below T m , and the most preferred stringent conditions are when the 
temperature is 10°C below T m . Nonspecific binding may also be controlled using 
20 any one of a number of known techniques such as, for example, blocking the 
membrane with protein-containing solutions, addition of heterologous RNA, 
DNA, and SDS to the hybridization buffer, and treatment with RNase. 

Wash conditions are typically performed at or below stringency. 
Generally, suitable stringent conditions for nucleic acid hybridization assays or 
25 gene amplification detection procedures are asas set forth above. More or less 
stringent conditions may also be selected. 

For the purposes of defining the level of stringency, reference can 
conveniently be made to Sambrook, J., E.F. Fritsch, et al. 1989 "Molecular 
Cloning: a Laboratory Manual, 2 nd Edition, Cold Spring Harbor, NY, Cold Spring 
30 Harbor Laboratory Press, at 1 1 .45. An example of low stringency conditions is 
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4-6X SSC/0. 1 -0.5% w/v SDS at 37°-45° C for 2-3 hours. Depending on the 
source and concentration of the nucleic acid involved in the hybridization, 
alternative conditions of stringency may be employed such as medium stringent 
conditions. Examples of medium stringent conditions include 1-4X SSC/0.25% 

5 w/v SDS at > 45° C for 2-3 hours. An example of high stringency conditions 
includes 0.1-1X SSC/0.1% w/v SDS at 60 C for 1-3 hours. The skilled artisan is 
aware of various parameters which may be altered during hybridization and 
washing and which will either maintain or change the stringency conditions. For 
example, another stringent hibridization condition is hybridization at 4X SSC at 

10 65° C, followed by a washing in 0. 1 X SSC at 65° C for about one hour. 
Alternatively, an exemplary stringent hybridization condition is in 50% 
formamide, 4XSSC, at 42° C. Still another example of stringent conditions 
include hybridization at 62° C in 6X SSC, .05X BLOTTO, and washing at 2X 
SSC, 0.1% SDS at 62° C. 

1 5 Other proteins or polypeptides of the present invention include 

polypeptides that have an amino acid sequence having at least 75 % similarity to 
the amino acid sequence of at least one of SEQ. ID. No. 3, SEQ ID No. 42, SEQ 
ID No. 44, or SEQ ID No. 46. In a preferred embodiement of the invention, the 
protein or polypeptide will have at least 90% similarity with at least one of SEQ. 

20 ID No. 3, SEQ ID No. 42, SEQ ID No. 44, or SEQ ID No. 46. Protein sequence 
similarity may be determined by the BLAST program with the default parameters 
(Altschul et al., "Basic Local Alignment Search Tool," J. Mol. Biol., 215:403-410 
(1990), which is hereby incorporated by reference). The CPS1 protein of SEQ. 
ID. No. 3 has a molecular weight of about 190-200 kDa, preferably 193.2 kDa. 

25 The CPS 1 protein contains two structurally similar modules, both of which are 
similar to SafBl, the first module of saframycin synthetase B (overall 25% 
identity; 50% similarity) and have apparent amino-acid-activating and thiolation 
domains with core sequences conserved in known peptide synthetases. 

The DNA molecule encoding a CPS1 protein or polypeptide of the 
30 present invention can be incorporated in cells using conventional recombinant 
DNA technology. Generally, this involves inserting the DNA molecule into an 
expression system to which the DNA molecule is heterologous (i.e., not normally 
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present). The heterologous DNA molecule is inserted into the expression system 
or vector in proper sense orientation and correct reading frame. The vector 
contains the necessary elements for the transcription and translation of the inserted 
protein-codin^sequences. U.S. Patent No. 4,237,224 to Cohen and Boyer, which 

5 is hereby incorporated by reference, describes the production of expression 
systems in the form of recombinant plasmids using restriction enzyme cleavage 
and ligation with DNA ligase. These recombinant plasmids are then introduced 
by means of transformation and replicated in unicellular cultures including 
procaryotic organisms and eukaryotic cells grown in culture. Recombinant genes 

1 0 may also be introduced into viruses, such as vaccinia virus. Recombinant viruses 
can be generated by transfection of plasmids into cells infected with virus. 

Suitable vectors include, but are not limited to, the following viral 
vectors such as lambda vector system gtl 1 , gt WES.tB, Charon 4, and plasmid 
vectors such as pBR322, pBR325, pACYC177, pACYC184, pUC8, P UC9, 

15 P UC18, pUC19, pLG339, pR290, pKC37, pKClOl, SV 40, pBluescript II SK +/- 
or KS +/- (see "Stratagene Cloning Systems" Catalog (1993) from Stratagene, La 
Jolla, Calif, which is hereby incorporated by reference), pQE, pIH821, pGEX, 
pET series (see Studier et. al., "Use of T7 RNA Polymerase to Direct Expression 
of Cloned Genes," Gene Expression Technology , vol. 185 (1990), which is hereby 

20 incorporated by reference), and any derivatives thereof. Suitable vectors are 
continually being developed and identified. Recombinant molecules can be 
introduced into cells via transformation, transduction, conjugation, mobilization, 
or electroporation using methods well known in the art. The DNA sequences are 
cloned into the vector using standard cloning procedures in the art, as described by 

25 Maniatis et al., Molecular Cloning: A Laboratory Manual Cold Springs 

Laboratory, Cold Springs Harbor, New York (1982), which is hereby incorporated 
by reference. 

A variety of host- vector systems may be utilized to express the 
protein-encoding sequence(s). Primarily, the vector system must be compatible 
30 with the host cell used. Host- vector systems include but are not limited to the 
following: bacteria transformed with bacteriophage DNA, plasmid DNA, or 
cosmid DNA; microorganisms such as yeast containing yeast vectors; mammalian 
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cell systems infected with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell 
systems infected with virus (e.g., baculovirus); and plant cells infected by bacteria 
or transformed via particle bombardment (i.e. biolistics). The expression elements 
of these vectors vary in their strength and specificities. Depending upon the host- 
5 vector system utilized, any one of a number of suitable transcription and 
translation elements can be used. 

Different genetic signals and processing events control many levels 
of gene expression (e.g., DNA transcription and messenger RNA ("mRNA") 
translation). Transcription of DNA is dependent upon the presence of a promoter 
10 which is a DNA sequence that directs the binding of RNA polymerase and thereby 
promotes mRNA synthesis. The DNA sequences of eukaryotic promoters differ 
from those of procaryotic promoters. Furthermore, eukaryotic promoters and 
accompanying genetic signals may not be recognized in or may not function in a 
procaryotic system, and, further, procaryotic promoters are not recognized and do 
1 5 not function in eukaryotic cells. Similarly, translation of mRNA in procaryotes 
depends upon the presence of the proper procaryotic signals which differ from 
those of eukaryotes. Efficient translation of mRNA in procaryotes requires a 
ribosome binding site called the Shine-Dalgarno ("SD") sequence on the mRNA. 
This sequence is a short nucleotide sequence of mRNA that is located before the 
20 start codon, usually AUG, which encodes the amino-terminal methionine of the 
protein. The SD sequences are complementary to the 3'-end of the 16S rRNA 
(ribosomal RNA) and probably promote binding of mRNA to ribosomes by 
duplexing with the rRNA to allow correct positioning of the ribosome. For a 
review on maximizing gene expression, see Roberts and Lauer, Methods in 
25 Enzvmologv. 68:473 (1979), which is hereby incorporated by reference. 

Promoters vary in their "strength" (i.e., their ability to promote 
transcription). For the purposes of expressing a cloned gene, it is desirable to use 
strong promoters in order to obtain a high level of transcription and, hence, 
expression of the gene. Depending upon the host cell system utilized, any one of 
30 a number of suitable promoters may be used. For instance, when cloning in E. 
coli, its bacteriophages, or plasmids, promoters such as the T7 phage promoter, 
lac promoter, trp promoter, reck promoter, ribosomal RNA promoter, the P R and 
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P L promoters of coliphage lambda and others, including but not limited, to 
/acUV5, ompY y bla, Ipp, and the like, may be used to direct high levels of 
transcription of adjacent DNA segments. Additionally, a hybrid trp-lacUVS (tad) 
promoter or other E. coli promoters produced by recombinant DNA or other 
5 synthetic DNA techniques may be used to provide for transcription of the inserted 
gene. Bacterial host cell strains and expression vectors may be chosen which 
inhibit the action of the promoter unless specifically induced. In certain operons, 
the addition of specific inducers is necessary for efficient transcription of the 
inserted DNA. For example, the lac operon is induced by the addition of lactose 
10 or IPTG (isopropylthio-beta-D-galactoside). A variety of other operons, such as 
trp,pro, etc., are under different controls. 

Specific initiation signals are also required for efficient gene 
transcription and translation in procaryotic cells. These transcription and 
translation initiation signals may vary in "strength" as measured by the quantity of 

1 5 gene specific messenger RNA and protein synthesized, respectively. The DNA 
expression vector, which contains a promoter, may also contain any combination 
of various "strong" transcription and/or translation initiation signals. For instance, 
efficient translation in E. coli requires a Shine-Dalgarno ("SD") sequence about 7- 
9 bases 5' to the initiation codon ("ATG") to provide a ribosome binding site. 

20 Thus, any SD-ATG combination that can be utilized by host cell ribosomes may 
be employed. Such combinations include but are not limited to the SD-ATG 
combination from the cro gene or the N gene of coliphage lambda, or from the £. 
coli tryptophan E, D, C, B or A genes. Additionally, any SD-ATG combination 
produced by recombinant DNA or other techniques involving incorporation of 

25 synthetic nucleotides may be used. 

The present invention also relates to anti-sense nucleic acid for 
essential cell proteins, such as replication proteins, which serve to render the host 
cells incapable of further cell growth and division. Anti-sense regulation has been 
described by Rosenberg et al, "Production of Phenocopies by Kruppel Antisense 
30 RNA Injection Into Drosophila Embryos" Nature , 313:703-706 (1985); Preiss et 
al., "Molecular Genetics of Kruppel, A Gene Required for Segmentation of the 
Drosophila Embryo," Nature . 313:27-32 (1985); Melton, "Injected Anti-sense 
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Elijah Cocks 
cohn@psfc.mit.edu 
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Enlarged Volume and Decoupled Air Flow Control 



Dan, 



I received your voicemail message about changing the MIT case numbers for Enlarged Volume and 
Decoupled Air Flow Control. These cases were originally input into our system as being Continuation in 
Parts (CIPs) of MIT 8530 Low Power Compact Plasma Fuel Converter (hence the designations CIP2 and 
CIP3) It has become clear that we are no longer claiming priority to this original application with these 
two applications; however, we should relabel internally with whatever actual MIT Case No. you believe 
these cases should actually be tagged under. Do you happen to know which MIT Case No. that is? 

Also, and perhaps more importantly with respect to filing the Enlarged Volume Plasmatron application, the 
two DOE grant numbers currently listed in the application were the ones for the MIT 8530 matter. Do you 
know if there are, in fact, different DOE grant numbers for whatever MIT Case No. the Enlarged Volume 
Plasmatron should actually be under? Do you happen to know what these DOE Grant numbers might be? 



Elijah Cocks 
Patent Attorney 
Choate, Hall & Stewart 
Exchange Place 
53 State Street 
Boston, MA 02109 

(617) 248-4792 - Direct Dial 
(617) 248-5000 -Main 
(617)248-4000- Fax 

E-mail: ecocks@choate.com 



CC: Pasternack, Sam 
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RNAs Specifically Block Messenger RNA Translation In vivo," Proc. Natl. Acad. 
Sci. USA . 82:144-148 (1985); Izant et al., "Constitutive and Conditional 
Suppression of Exogenous and Endogenous Genes by Anti-sense RNA," Science , 
229:345-352 (1985); Kim et al., "Stable Reduction of Thymidine Kinase Activity 

5 in Cells Expressing High Levels of Anti-sense RNA," Cell, 42:129-138 (1985); 
Pestka et al., "Anti-mRNA: Specific Inhibition of Translation of Single mRNA 
Molecules," Pmr Natl Acad. Sci. USA . 81:7525-7528 (1984); Coleman et al., 
"The Use of RNAs Complementary to Specific mRNAs to Regulate the 
Expression of Individual Bacterial Genes," CeU, 37:429-436 (1984); and McGarry 

1 0 et al., "Inhibition of Heat Shock Protein Synthesis by Heat-Inducible Antisense 
RNA," Proc. Natl. Acad. Sci. USA. 83:399-403 (1986), which are hereby 
incorporated by reference. 

Once the isolated DNA molecules encoding the CPS1 protein or 
polypeptide, as described above, has been cloned into an expression system, they 

1 5 are ready to be incorporated into a host cell. Such incorporation can be carried out 
by the various forms of transformation noted above, depending upon the 
vector/host cell system. Suitable host cells include, but are not limited to, 
bacteria, virus, yeast, mammalian cells, insect, plant, and the like. In the present 
invention, the host cells may be from plants such as corn, oat, grass, weed, 

20 bamboo, and sugarcane. 

One aspect of the present invention involves using an inhibitor of 
the CPS1 protein to interfere with the plant infection process in order to impart 
disease resistance to plants. In one mechanism, the peptide synthetase CPS1 
could be inhibited by an appropriate drug, thereby causing the plants to be 
25 resistant to fungal attack. Alternatively, the CPS1 nonribosomal product could be 
degraded by an enzyme for which it is a suitable substrate, and, when the gene 
encoding this enzyme is genetically engineered into plants, the plants will become 
resistant to fungal attack. 

In this aspect of the present invention, large numbers of 
30 compounds can be screened for their activity as inhibitors of CPS 1 protein by a 
high-throughput screening assay as described in U.S. Patent No. 5,876,946 to 
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Burbaum et al., which is hereby incorporated by reference. Generally, a library of 
compounds is assayed for inhibition of an enzyme catalyzed reaction and the 
amounts of fluorescence bound to individual suspendable solid supports measured 
to determine the degree of inhibition. For example, the amount of fluorescence 

5 bound to a microbead in the presence of inhibitory compounds is greater than for 
non-inhibitory compounds. The amounts of fluorescence bound to individual 
beads are determined by confocal microscopy. Using this type of assay, inhibition 
can be determined of a peptide synthetase such as CPS 1 . For CPS 1 , the substrate 
can be amino acids (or hydroxy acids), linked at one end to the microbead and at 

1 0 the other end to a fluorescent label. The enzyme inhibitors can be utilized to 
impart fungal resistance to a variety of plants including oats, grasses, weeds, 
sugarcane, and corn in particular. 

Thus, the present invention provides a method for identifying 
inhibitors of a CPS1 protein , wherein said CPS1 protein is a peptide synthetase 

15 of a plant pathogen. The method comprises: providing a CPS 1 protein or 
polypeptide, contacting the protein or polypeptide with potential inhibitor 
compounds; determining peptide synthetase activity, and selecting compounds 
which decrease the peptide synthetase activity. The method is especially useful in 
identifying inhibitors of a CPS1 protein from plant pathogens of the genera 

20 Cochliobolus, Alternaia, Fusarium, and Pyrenophora such as those described 

hereinabove. Preferably, the method may be used to identify inhibitors of a CPS1 
protein from Alternaria solani, Fusarium graminearium, and Pyrenophora teres. 

Another aspect of the present invention involves using one or more 
of the above DNA molecules encoding a CPS1 protein or polypeptide or a gene 

25 encoding an enzyme that degrades the CPS 1 N.R.P. product to transform plants in 
order to impart fungal resistance to the plants. This concept of pathogen-derived 
resistance, according to U.S. Patent No. 5,840,481 to Johnston and Sanford, which 
is hereby incorporated by reference, is that host resistance to a particular parasite 
can effectively be engineered by introducing a gene, gene fragment, or modified 

30 gene or gene fragment of the pathogen into the host. This approach is based on 
the fact that in any parasite-host interaction, there are certain parasite-encoded 
cellular functions (activities) that are essential to the parasite but not to the host 
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and that when one of the essential functions of the parasite such as survival or 
reproduction is disrupted, the parasitic process will be stopped. "Disruption" 
refers to any change that diminishes the survival, reproduction, or infectivity of 
the parasite. Such essential functions, which are under the control of the 

5 parasite's genes, can be disrupted by the presence of a corresponding gene product 
in the host which is (1) dysfunctional, (2) in excess, or (3) appears in the wrong 
context or at the wrong developmental stage in the parasite's life cycle. If such 
faulty signals are designed specifically for parasitic cell functions, they will have 
little effect on the host. Therefore, the procedure for making plants, for example, 

10 resistant to infection by one or more fungus involves isolating DNA coding for a 
gene such as CPS1 of a fungus, operably linking the DNA within an expression 
vector, transforming the plant cell or plant tissue with the expression vector, and 
growing the transformed plant cells or plant tissue in the presence of the fungus 
such as e.g., Cochliobolus heterostrophus, Alternaria solani, Fusarium 

15 graminearium, or Pyrenophora teres, where the CPS1 DNA is expressed as a 
gene product and the CPS protein disrupts the essential activity of the fungi. 

Thus, the present invention provides a method of imparting disease 
resistance to a plant by over-expressing a CPS1 polypeptide in the plant, wherein 
the polypeptide has protein synthetase activity. In praticing this aspect of the 

20 invention, the plant may be any plant in which it is desired to impart disease 

resistance. Thus, the plant may be an agrigultural crop or ornamental plant. The 
plant may be herbacious or woody. The plant may be a monocot or dicot. 
Examples of plants which may be used in practicing the present invention, include 
but are not limited to, com, oats, grasses, weeds, sugarcane, barley, wheat, rice, 

25 tomato, potato, citrus, malus, rye, cotton, brassica, cabbage, and carrot. Many 
other plants may also be used in the practice of the present invention. As a 
guideline, plants which serve as hosts for Cochliobolus sp. } Fusarium sp. t 
Alternaria sp. t and Pyrenopherasp., may be used. Reference to host plants may 
be conveniently made to Fungi on Plants and Plant Products in the United States, 

30 David, F. Farr et al. editors, American Phytopathological Society Press, St. Paul, 
Minnesota, 1989. 
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In another preferred embodiment, the CPS1 peptide is from the 
genera Cochliobolus, Alternaria, Fusarium ,or Pyrenophora. In an even more 
preferred embodiment, the CPS1 gene is from Cochliobolus heterostrophus, 
Alternaria solani, Fusarium graminearium or Pyrenophora teres, 

5 Promoters and other regulatory regions which function in plants are 

well known and include e.g., constitutive promoters, inducible promoters, 
temporally regulated and tissue specific promoters. Examples of constitutive 
promters include e.g., actin, CAMV 35S, MAS, ubiquitin, rice cyclophilin, maize 
H3 histone, and actin 2. Examples of tissue specific promoters include e.g., leaf 

1 0 specific promoters such as the RuBisCo ssu, Cab (chlorophyll a/b/binding) 

protein, and the AldP gene promoter from rice (Kagaya et al., 1995 Molecular and 
General Genetics 245:668-674. Examples of root specific promoters include e.g., 
beta tubulin (Oppenheimer et al. Gene 63:87, 1988), and SbPRPl (Suzuki et al., 
Plant Mol Biol 27:109-1 19, 1993. Thus, using well known methods and widely 

1 5 available regulatory sequences, the skilled artisan is direct expression of a subject 
CPS nucleic acid molecule in a plant. 

To provide regulated expression of a CPS gene of the present 
invention, plants are transformed with a vector which replicates in a plant cell and 
which have a promoter which directs expression of the CPS gene product in the 

20 plant. Methods of plant transformation are well known in the art. A vector 
comprising a subject nucleic acid molecule coding for a CPS gene or fragment 
thereof may be introduced into a plant by leaf disk transforamtion-regeneration 
procedure as described by Horsh et al. (1985) Science 227:1229-1231. Other 
methods of transformation such as protoplast culture (Horsh et al. 1984 Science, 

25 225:496; DeBlock et al. (1984) Embo J, 2:2143; Barton et al. (1983) Cell, 
32: 1033) may also be used and are within the scope of this invention. 

In transforming dicot plant species, plants may be transformed 
Agrobacterium-derived vectors such as those described in Klett et al. (1987) 
Annu. Rev. Plant Physiol, 55:467. Other well known methods are available to 
30 insert the subject CPS genes into plant cells. Such alternative methods include 
biolistic approaches (Klein et al. 1987, Nature:321\lQ\ electroporation, 
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microinjection (Potrykus and Spangenberg eds., Gene Transfer to Plants, Springer 
Verlag, Berlin , 1995), chemically-induced DNA uptake, the use of viruses or 
pollen as vectors, liposome mediated transformation, transformation using 
wounded or enzyme-degraded immature embryos, or wounded or enzyme 
5 degraged embryonic callus. 

For transformation of monocot plant species, a number of well 
known methods may be used such as biolistic methods (Tang, K. W. et al. 2000, 
"Acta Biotechnologies 20(2): 175-183; de-Villiers, S. M. et al., 2000, South 
African Journal of Plant and Soil. 17(1): 50- 53) protoplast transformation, 
1 0 electroporation of partially permeabilized cells, and introduction of DNA using 
glass fibers, etc. Agrobacterium may also be used (Raineri, D. M., et al., 1990 
Bio/Technology Vol. 8 :33-38). 

As used herein, "transformation" refers to the transfer of an 
exogenous nucleic acid molecule into a host cell. The nucleic acid molecule may 
1 5 be stably or transiently introduced into the host cell and may be maintained non- 
integrated for example, as a plasmid, or alternatively, may be integrated into the 
host genome. The resulting transformed plant cell(s) may then be used to 
regenerate a transformed plant via standard methods. 

The invention is further illustrated by the following examples 
20 which are not intended in any way to limit the scope of the invention. 

EXAMPLES 

EXAMPLE 1 ~ Materials and Methods for Examples 2-7 

Strains, Media, Crosses and Transformation. C4 (Toxl + ; MAT- 
2) and C5 (Toxl-, MAT-1) are member of near-isogenic C. heterostrophus strains 

25 (Leach et al., 1 982, which is hereby incorporated by reference). R.C4.2696 
(Tox + ; MAT-2; hygB R ) is a C4-derived mutant generated using the REMI 
mutagenesis procedure (Lu et al., "Tagged Mutations at the Toxl Locus of 
Cochliobolus heterostrophus Using Restriction Enzyme-Mediated Integration," 
Pmc Natl. Acad. Sci. USA. 91 : 12649-12653 (1994), which is hereby incorporated 

30 by reference). Strains 1301R33 (Tox'; MAT-2; hygB% 1301R45 (Tax-, MAT-1; 
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hygB R \ 1301R26 (Tox+\ MAT- 2; hygB R ) are progeny of the cross C5 X 
R.C4.2696. Culture media, including CM (complete medium), CMX (complete 
medium with xylose instead of glucose), CMNS (CM with salts omitted), and MM 
(minimal medium) have been described, as have mating procedures (Leach et al., 

5 1 982; Turgeon et al., "Transformation of the Fungal Maize Pathogen 

Cochliobolus heterostrophus Using las Aspergillus nidulans amdS Gene," MoL 
Gen. Genet. , 201 :450-453 (1985), which are hereby incorporated by reference). 
All strains were grown at 24°C under the warm white light or black light 
(F40/350BL) (Sylvania Inc., Danvers, MA). Ascospore germination was done at 

1 0 32°C in the dark for 3 days. REMI transformants were purified by transferring the 
transformants from the original REMI plates to fresh CMNS medium containing 
hygromycin B (Calbiochem R ) at 80 ug/ml. For conidiation, stable transformants 
were transferred to CMX containing the same drug but at a higher concentration 
(120 ug/ml) to compensate for reduced drug activity due to the inhibition by the 

1 5 salts in the medium. Single conidia were picked up under a dissecting microscope 
and grown on CMNS hygromycin B plates; stable colonies were then transferred 
to individual CMX/ hygromycin B plates. All purified transformants were stored 
at -70°C in CM liquid medium containing 25% of glycerol in 96-well microtiter 
dishes. 

20 Bioassays. Fungal strains were grown on CMX plates ( 1 00 X 

15mm) for 7-10 days at 24°C under the light for maximum conidiation. To verifiy 
normal T-toxin production by a race T isolate, 1.0 ml of T-toxin-sensitive E. coli 
(DH5a) cells were evenly spreaded on LB medium containing ampicillin (100 
ug/ml) and the plates were allowed to air dry for 30 min in a laminar hood. Agar 

25 plugs bearing fungal mycelia were inoculated (upside down) onto the E. coli cell 
lawn and the plates were incubated at 32°C. Wild type race T and race O were 
used as controls for each assay plate. T-toxin-producing strains of the fungus will 
inhibit growth of the E. coli cells and produce halos. Tox' mutants can be 
distinguished from wild type by failure to produce a halo (tight) or by production 

30 of halos smaller (leaky) or larger than wild type (overproducing). All Tox' 
mutants were transferred to Fries medium (Pringle et al., "The Isolation of the 
Toxin of Helminthosporium victoriae? Phytopathology , 47:369-371 ( 1957), 
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which is hereby incorporated by reference), which optimizes toxin production, and 
retested. T-cytoplasm corn plants (inbred W64A) are used to verify the Tox' 
mutants identified from the E. coli assay using the procedure described below. 
Mutants defective in T-toxin production fail to produce typical race T symptoms 

5 on T-corn. Pathogenicity phenotype on N-cytoplasm corn and virulence of Tox+ 
strains to T-cytoplasm corn were determined by a plant assay where about 3,000 
transformants generated using the REMI mutagenesis procedure (Lu et al., 
"Tagged Mutations at the Toxl Locus of Cochliobolus heterostrophus Using 
Restriction Enzyme-Mediated Integration," Proc. Natl Acad. Sci. USA, 

10 91 : 12649-12653 (1994), which is hereby incorporated by reference) were 

screened for mutants defective in ability to cause disease on corn plants. Two 
week old N-cytoplasm corn plants (inbred W64A) grown in the green house (5-6 
plants in one 4" X 6" pot) were inoculated with 5 ml conidial suspensions (10 5 
conidia/ml) using a pressurized Preval Spray Gun Power Unit thin layer 

1 5 chromatography sprayer (Alltech Associates, Deerfield, IL), incubated in the mist 
chamber for 24 hours (23°C) and then taken to the growth chamber (23°C, 80% 
humidity, 14 hours of light). The mutant phenotypes were determined by 
occurrence of apparent variations in disease symptom development, mainly by 
lesion size comparison. Mutants producing lesions smaller than wild type were 

20 retested and lengths of typical lesions from each mutant were compared with wild 
type 7 days after inoculation and measurements were taken for statistical 
evaluation. 

DNA manipulations and sequencing. Genomic and plasmid DNA 
preparation, restriction enzyme digestions, gel electrophoresis and gel blot 

25 analysis were done using standard protocols (Sambrook, et ah, Molecular 

Cloning: A Laboratory Manual, 2 nd Ed., Cold Spring Harbor, New York:Cold 
Spring Harbor Laboratory Press (1989), which is hereby incorporated by 
reference). DNA was sequenced at the Cornell DNA Sequencing Facility using 
TaqCycle automated sequencing with DyeDeoxy terminators (Applied 

30 Biosystems, Foster City, CA). pUCATPH was used for subcloning (Table 1). 
Primers used for sequencing (Table 2) were designed using Primer Select 
(DNASTAR Inc., LaserGene system) and synthesized by the Cornell 
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Oligonucleotide Synthesis Facility. Sequencing of each plasmid clone was 
initiated with vector-specific primers or primers designed to previously 
determined sequences. Sequences obtained were analyzed using the same system 
and nucleotide or protein database searches were performed with the BLAST 
5 program (Altschul et al, "Basic Local Alignment Search Tool " J. Mol. Biol., 
215:403-419 (1990), which is hereby incorporated by reference). 
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Table 1. Transformation vectors and clones used 



Plasmid 



Length 
(kb) a 



Characteristics 



Reference 



See Figure 29. 

Cloning vector, same as pUCATPH but Figure 8 

lacking a 420 bp Narl fragment containing 
the HintiBl site 

A clone containing pUCATPH recovered from Figure 2 
the tagged site in mutant R.C4.2696 by 
religation of Bg/II-digested genomic DNA 
As above but with Afed-digested genomic Figure 2 
DNA 

As above but with Sad-digested genomic Figure 2 
DNA 

Narl fragment derived from 214S1 Figure 4 

containing a 0.8 kb Narl-Sacl fragment of 
genomic DNA ligated to pUCl 8 
Vector for targeted integration constructed Figure 4 
by ligating ///wdlll-digested pUCATPH into 
the Hindlll site of p214SlN 

Vector for targeted integration constructed Figure 6 
by ligation of a 2.2 kb Sad fragment of 
pi 1 8B14 into the Sacl site of pUCATPH 
Vector for targeted integration constructed Figure 9 
by ligation of a 0.8 kb Sspl fragment of 
pi 18BC4 into the Sspl site of pUCATPHN 
A clone recovered from the p2 14SNP Figure 5 

integration site in transformant #1 18 by 
ligation of a fig/II-digested genomic DNA 
fragment containing the entire vector 
A clone recovered from same site as above Figure 5 
but by ligation of a 5c/I-digested genomic 
DNA fragment containing part of vector 
(214SNP) sequence 

A clone recovered from the pi 18BSP Figure 7 

integration site in transformant #9 by ligation 
of a Ps/I-digested genomic DNA fragment 
containing pUC18 

A clone recovered from the pi 1 8BCS Figure 9 
integration site in transformant #12 by 
ligation of a /fiwdlll-digested genomic DNA 
fragment containing the entire vector. 

a. An underlined kb number indicates that the plasmid carries genomic DNA 
5 sequences (see related figures for details). 



pUCATPH 5.1 

pUCATPH 4.6 
N 

p214B7 92 



p214Ml 63 

p214Sl 93 

P 214S1N 33 

p214SNP M 

pll8BSP 23 

pll8BCS SA 

pll8B14 10.4 

pll8BC4 6J 



p9P2 



73 



pl2H6 8J> 
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Table 2. Primers used for sequencing recovered genomic DNA flanking the REMI 
insertion site at the R.C4 2696 mutation. 



Name 8 



Position 6 Sequence 0 



Plasmid d Origin 6 



M 1 3RMT GCGG ATA ACAATTTC AC ACAGG A 

SEQ. ID. No. 4 

1 RPib 775 AGGCCCAGCTGCTTCTCTTG 

SEQ. ID. No. 5 

2 RP2 604 ACTCGGACCGGACGGAATAACAA 

SEQ. ID. No. 6 

3 R P3 119 CGGAAGGAGTGCGAACAA 

SEQ. ID. No. 7 

4 RP4 -232 GCTGCTTGCATCTGGTCTTG 

SEQ. ID. No. 8 

5 RP5 -812 AGACCCAGCTGTTGCCCATTG 

SEQ. ID. No. 9 

6. RP5b -1215 CGGAGACGCAAAGCCTGAGA 
SEQ. ID. No. 10 

7 RP6 -1392 TGCCAGCTGCGTCCAAGAAG 

SEQ. ID. No. 1 1 

8 RP7 -1839 GCTAGCATGGCCCTCACAC 

SEQ. ID. No. 12 
TrpC TGTGTTGACCTCCACTAGCTC 
SEQ. ID. No. 13 

9 FP1 1885 CTACGGGATGCAGAGGGAAAGT 

SEQ. ID. No. 14 
10. FPlb 1828 GCCATGATTAGCACGATACCC 
SEQ. ID. No. 15 

11 FP2 2028 CGCGCTGCATACAACTACCAA 

SEQ. ID. No. 16 

12 FP3 2490 TGGTGGCACTACAAAGAACA 

SEQ. ID. No. 17 

13 FP4 2949 CAGCGTGTTGAATGGGTAGAA 

SEQ. ID. No. 18 

14 FP4B 2745 CTGGGTAGATTCGCCTTCAC 

SEQ. ID. No. 19 

1 5. FP5 342 1 G AGCG ATC AGTGC AG A AC ATT 

SEQ. ID. No. 20 

16. FP6 3948 CGCTGACGTTTGACCATCTGA 

SEQ. ID. No. 21 
17 FP7 4411 GCATATGCAACGAGTCAAA 

SEQ. ID. No. 22 

1 8. FP8 5035 ACGGTGCACCTGTTGATA 

SEQ. ID. No. 23 

19. FP9 5457 ATGCGCACAATAGCCCAGTA 

SEQ. ID. No. 24 
20 RP48 2865 TTCAAGCAACTGTGGCGTAGG 
SEQ. ID. No. 25 

2 1 . FP1 0 5790 GATCCTAGCG ACCGCACACCAAC 

SEQ. ID. No. 26 
22 FPU 6327 CCTGCTGCTGGTGCTTCT 

SEQ. ID. No. 27 



A 


pUC18 


A 


214B7TrpC 


A 


214B7RPlb 


A 


214B7RP2 


A 


2I4B7RP3 


A 


214B7RP4 


A 


214B7RP4 


A 


2I4B7RP5 


A 


2I4B7RP6 


A 


PUCATPH 


A 


214B7TrpC 


B 


214B7TrpC 


B 


214MlFPlb 


C 


214M1FP2 


C 


214S1FP3 


C 


214S1FP4 


C 


214S1FP4 


C 


214S1FP5 


C,D 


214S1FP6 


D 


118B14FP7 




1 1 8BC4FP8 


D 


214S1FP6 


F 


9P2FP9 


F 


9P2FP10 
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23. FPUb 


6211 


GAGTTGCAAATCGTGACAGC 
SEQ. ID. No. 28 


F 


9P2FPI0 


24. FP12 


6457 


TATCAGCTGTTGTTCAATGTTCTA 


F 


9P2FP11 




SEQ. ID. No. 29 




9P2FP12 


25. FP13 


6854 


TGTTATCCCATTGCCATTG 


F 






SEQ. ID. No. 30 




9P2FP13 


26. FP14 


7400 


AAGGACGGAGATTGGTGGAG 


F 






SEQ. ID. No. 31 




9P2FP14 


27.FP15 


7771 


GGAGATGGCGGTGACGA 


F 






SEQ. ID. No. 32 




9P2FP15 


28. FP16 


8145 


GCATGGCTTGTGGAGGAC 


F 






SEQ. ID. No. 33 




9P2FP16 


29. FP17 


8492 


AGATTGTGGCTAGTATGGAGGTAA 


F 






SEQ. ID. No. 34 




pUC18 


M13F40 




GTTTTCCCAGTCACGAC 


G 






SEQ. ID. No. 35 




9P5M13F4 


30. RP1 


8953 


T ACTACTAG C ATACC AG CAT A CCT 


G 






SEQ. ID. No. 36 




9P5RP1 


31. RP2 


8559 


TCAACCTCGGAATACCAAGTC 
SEQ. ID. No. 37 


G 



a. "RP" indicates reverse primer; "FP" indicates forward primer. Primers 
designed to genomic DNA sequences are numbered in order. For stock tube and 

5 the notebook, Primers 1-17 have a leading number "214"; 18-20 with "1 18"; 21-29 
with "9P2" and 30-31 with "9P5". M13RMT(a M13R mutant version; there is a 
mutation in the polylinker of pUC18) and M13F-40 are provided by Cornell DNA 
Sequenceing Facility. TrpC primer site is in the pUCATPH TrpC promoter region 
38 bp from Sail site with sequencing direction from Sail to Kpnl. 

1 0 b. The position of the first base of each primer corresponds to the assembled 

sequence (CPS1 + TES1, total 1 1 .3 kb found in Lu's folder in the computer Yoder 
labl) 

c. Each primer sequence is given in the 5' to 3' direction 
d Plasmids used as templates for each sequencing reaction. A = p214B7; B = 
15 P 214M1; C - P 214S1; D = pi 18B14; E = pi 18BC4; F = P 9P2. G = p9P5 (=9P2) 
e. Original sequences that were used for primer design can be found in the CPS1 
sequence notebook or in Shunwen Lu's folder (CPS1 sequence) in the computer 
Yoder labl under the same names as listed. 

20 Recovery of tagged DNA from the REMI insertion site and 

targeted gene disruption. Genomic DNA of mutant R.C4.2696 was digested with 
Bgtil, Mscl (no sites in pUCATPH) or Sacl (which cuts the vector once) and 
purified by phenol extraction and ethanol precipitation, then dissolved in TE (pH 
8.0). Ligation was performed in 50 ul reaction mixture, containing 1 x T4 DNA 

25 ligase buffer with 10 mM ATP, 60 units T4 DNA ligase (New England Biolabs, 
Beverly, MA) and 3 ug of 5g/II-digested genomic DNA, at 14°C overnight. 10 ul 
of ligation mixture was used to transform 200 ul of competent DH5a cells, 
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prepared using the calcium chloride treatment (Sambrook, et al., Molecular 
Cloning: A Laboratory Manual , 2 nd Ed., Cold Spring Harbor, New York:Cold 
Spring Harbor Laboratory Press (1989), which is hereby incorporated by 
reference), to ampicillin resistance. Ampicillin resistant clones were analyzed by 
5 digestion of plasmid DNA with several diagnostic restriction enzymes and clones 
containing the REMI vector plus flanking genomic DNA were sequenced using 
the vector-specific primers (M13R or TrpQ. Three plasmids, p214B7, P 214M1 
and p214Sl (Figure 2) were recovered and used for sequencing. For targeted gene 
disruption in wild type, p214B7 was amplified and plasmid DNA purified by 
1 0 equilibrium centrifugation in CsCl-ethidium bromide gradients (Sambrook, et al., 
Molecular Clon in p: A Laboratory Manual, 2 nd Ed., Cold Spring Harbor, New 
York:Cold Spring Harbor Laboratory Press (1989), which is hereby incorporated 
by reference). 30 ug of plasmid DNA (linearized with flg/H for double crossover 
integration as illustrated in Figure 3) were used to transform wild type and the 
1 5 transformants were purified by isolation of single conidia, assayed for 
pathogenicity and characterized by gel blot analysis. 

Sequence extension by targeted integration and plasmid rescue. 
Two overlapping cosmid clones were isolated by probing a genomic DNA library 
of C4 constructed on a cosmid vector, but both extended into the left region only 
20 of p214B7. To extend to the right, a chromosome walking strategy was employed 
(Figure 3). Three targeted gene disruption experiments (each followed by plasmid 
rescue) were done successively. In the first experiment, a vector was constructed 
as follows (see Figure 4 for details): p214Sl was digested withJVarl and religated 
to create p214SlN, which was then digested with HindlU and ligated into the 
25 Hindlll site of pUCATPH to create p2 1 4SNP for transformation of race O (C5). 
One transformant (Txl 1 8) resulting from homologous integration (confirmed by 
gel blot analysis) was used for plasmid rescue as described above. Two new 
plasmids pi 18B14 and pi 18BC4 were recovered, both of which carry sequence at 
the 3' end but only 172 and 680 bp more than p214Sl, respectively (Figure 5). To 
30 continue the walk, p 1 1 8B 1 4 was digested with Sad and ligated into the Sad site 
of pUCATPH to create pi 18BSP (Figure 6). This vector was transformed into 
wild type and one plasmid, p9P2 was recovered (from transformant Tx9), which 
extends 4.4 kb into the region 3' of pi 18BC4 and contains the 3' end of CPS! 
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(Figure 7). A third experiment was done in an attempt to recover a 15 kbXhol 
fragment at the 3' end of that tagged gene, pi 18BCS (Figure 8) was constructed 
by subcloning a 0.8 kb Sspl fragment into the same site of pUCATPHN. Plasmid 
rescue using Xhol digested-genomic DNA of a transformant (TX12) failed to 
5 recover the 1 5 kb Xhol fragment, but pl2H6 was recovered using //mdlll-digested 
genomic DNA of the same transformant; the genomic DNA matched that already 
cloned on p9P2 (Figure 9). 

EXAMPLE 2 - Characterization of the REMI mutant. 

10 1 ) In all culture conditions used in the lab, mutant R.C4.2696 

grows just like wild type with no variations in growth rate, color and 
morphological features (Figure 10A). It produces normal conidia that germinate 
and form infection structures like wild type when induced on artificial surfaces 
(Figure 10B) and shows normal mating ability when crossed to wild type testers. 

1 5 No pleiotropic phenotypes associated with the mutation have been detected so far. 

2) The mutant differs from wild type in the ability to cause disease 
on corn plants. When tested on T-cytopIasm corn, the mutant produces race T 
type symptoms but the disease develops more slowly than with wild type although 
it produces wild type levels of T-toxin as detected in a microbial assay (Figure 

20 11), suggesting that the reduced virulence is not related to a deficiency in the 

ability to produce T-toxin. This is clearer on N-cytoplasm corn where the mutant 
produces lesions significantly smaller than those produced by wild type (Figure 
12). When the mutant was crossed to a wild type race O tester, the small lesion 
phenotype and ability to produce T-toxin segregated independently, indicating that 

25 mutant phenotype is not associated with the reduced fitness trait tightly linked 
with the Toxl locus (Klittich et al., "Reduced Fitness Associated With Toxl of 
Cochliobolus h*tArn*tmphus. n Phytopathology . 76:1294-1298 (1986), which is 
hereby incorporated by reference). The statistical evaluation of lesion size in the 
race O genetic background indicates that the mutation causes 60% reduction in the 

30 fungal virulence to corn plants (Figure 1 3). 

3) The mutant phenotype is caused by a tagged, single site 
mutation. In crosses between the mutant and wild type testers, progeny 
segregated 1 : 1 for parental types only and all hygromycin B-resistant progeny 
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produced lesions similar to the mutant parent; all hygromycin B-sensitive progeny 
produced wild type lesions (Figure 14), indicating that a tagged mutation is 
responsible for the reduced pathogenicity of the mutant. 

5 EXAMPLE 3 -- Cloning and sequencing of DNA flanking the REMI 

vector insertion site. 

A total of 1 1 .3 kb of genomic DNA surrounding the insertion site 
was cloned and completely sequenced (Figure 15). The sequence was derived 
from seven plasmid clones. The first three (p214B7, P 214M1 and P 214S1) were 

1 0 recovered from the tagged site in mutant R.C4.2696 and cover about 60 % (6.6 
kb) of the entire region. The rest (pi 18B14, pi 18BC4, p9P2 and p!2H6) were 
recovered from transformants generated using the chromosome walking strategy. 
DNA to the left of the insertion site (3.4 kb) was cloned on p214B7; DNA on the 
right (7.9 kb) was cloned on different overlapping plasmids. p9P2 carries the 

1 5 largest amount (4.6 kb) including genomic DNA on pl2H6 (Figure 15). 

EXAMPLE 4 - Identification ofCPSl and TES1 at the sequenced region. 

Analysis of the combined sequences revealed two open reading 
frames (ORFs). ORF 1(5.4 kb) starts 576 bp upstream of the REMI vector 

20 insertion site and ends with an in-frame stop codon (TAG) 3029 bp from the end 
of the sequenced region in the right flank (Figure 15). No "TATA" box-like 
element is found in the expected position, but five putative "CAAT" boxes are 
located upstream of the start codon (ATG), three of them are in the range found in 
most filamentous fungal promoters (60-200 bp) (Gurr et al., 1987, which is hereby 

25 incorporated by reference). Sequence around ATG of ORF 1 (CACCATGCT 
(SEQ. ID. No. 38)) is similar to the fungal consensus (CACCATGGC (SEQ. ID. 
No. 39)). Although there are several ATGs found upstream, they are less likely to 
be used as a start codon because the surrounding sequences lack similarity to the 
consensus. Three putative introns are identified by their conserved 5' and 3' 

30 border sequences and potential branch sites (Table 3). Splicing these introns 

eliminated stop codons which would otherwise interrupt the 5.4 kb open reading 
frame. Three introns have similar size (45-53 bp respectively) which is in the 
range of intron size determined from most fungal genes. A putative 
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polyadenylation signal (ATAA) is found 223 bp downstream of the translation 
termination site. The G+C content of ORF1 is 51.5%, which is similar to most 
Cochliobolus genes (Turgeon et ah, "Cloning and Analysis of the Mating Type 
Genes from Cochliobolus heterostrophus" Mol. Gen. Gene.. 238(1 -2):270-284 
5 ( 1 993); Van Wert et al "Structure of the Cochliobolus-heterostrophus 

Glyceraldehyde-3-Phosphate Dehydrogenase Gene," Cum Genet. , 22(l):29-35 
(1992); Yang et al., "A Polyketide Synthase is Required for Fungal Virulence and 
Production of the Polyketide T-Toxin," Plant CelU 8(1 1):2139-2150 (1996); Rose 
et al., "A Decarboxylase Required for Polyketide Toxin Production and High 

1 0 Virulence by Cochliobolus heterostrophus" 8 th Int. Svmp . Mol. Plant-Microbe 
Int. . Knoxville, p. J-49 (1996), which are hereby incorporated by reference). 
Interestingly, ORF1 is flanked by two regions of G+C rich DNA. The first (1 .4 
kb, 60.7% G+C) is found between ORF1 and ORF2; the second (1.2 kb, 60.3% 
G+C) is found 1.8 kb downstream of the stop codon of ORF1 (Figure 16). 

1 5 Database searches using the translated protein sequence of ORF1 revealed high 
similarity to SafB, one of the multifunctional enzymes catalyzing the biosynthesis 
of the cyclic peptide antibiotic saframycin Mxl produced by the bacterium 
Myxococcus xanthus (Pospiech et al., "Two Multifunctional Peptide Synthetases 
and an O-methyltransferase are Involved in the Biosynthesis of the DNA-Binding 

20 Antibiotic and Antitumour Agent Saframycin Mxl from Myxococcus xanthus" 
Microbiology , 142(4):74 1-746 (1996), which is hereby incorporated by 
reference). The entire nucleotide sequence of ORF1 (designated CPSI ) is given 
in Figure 17. 

Table 3. Characteristics of putative introns in CPSI and TESl 
25 



Gene 


Intron 


Size(bp) 


Location 


5'Border 


3 'Border 


Branch Site 


CPSI 


I 


45 


3060-3105 


GTAAGT 


TAG 


GTCTAAC 




II 


51 


4532-4582 


GTAAGT 


CAG 


TGCTAAC 




III 


53 


5187-5239 


GTACGT 


CAG 


TACTAAC 


TESl 


I 


49 


528-566 


GTAAGT 


TAG 


CCTTAAG 


Consensus 








GTA A /cGT 


T /CAG 


YNCTAAC* 
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* Y = Pyrimidine (T or C); N = purine or pyrimidine. 

ORF2 starts about 1.6 kb upstream of the start codon of CPS1 and 

5 is transcribed in the opposite direction (Figure 1 5). No "TATA" box-like element 
and CAAT box are found; instead, an AT-rich sequence "AAAACTAT" (SEQ. 
ID. No. 40) is located 1 1 bp upstream of the start codon ATG and a CT motif is 
found in the -30 region, which is characteristic of a number of fungal genes that 
lack a CAAT box in their promoter region (Gurr et al., "The Structure and 

10 Organization of Nuclear Genes of Filamentous Fungi," in Kinghorn, ed., Gene 
Structure in Eukarvotic Microbes . Vol. 22, published by the Society for General 
Microbiology, Oxford, England:IRL Press, pp. 93-140 (1987), which is hereby 
incorporated by reference). The sequence around ATG matches perfectly fungal 
gene consensus. A putative intron (50 bp) is found in the middle of ORF2 with 

1 5 conserved 5* and 3' border sequences and a potential branch site (Table 3). A 
putative polyadenylation signal (AAATA) is found 189 bp downstream of the 
translation stop codon TGA. The G+C content of ORF2 is 55.5%, which is 
slightly higher than the normal range because the 5' end of ORF2 is located in the 
region of G+C rich DNA upstream of ORF1 (Figure 16). Database search 

20 revealed that ORF2 encodes a protein with high similarity to Homo sapiens 
thioesterase II (hTE, Liu et al., "Binding of HIV-1 Nef to a Novel Thioesterase 
Enzyme Correlates with Nef-Mediated CD4 Down-Regulation," J. Biol. Chem., 
272(21:13779-13785 (1997), which is hereby incorporated by reference) and E. 
coli thioesterase II encoded by the tesB gene (Naggert et al., "Cloning, 

25 Sequencing and Characterization of Escherichia-coli Thioesterase II," J- Biol. 
Chem. . 266(17)11044-11050 (1991), which is hereby incorporated by reference). 
The nucleotide sequence of ORF2 (designated TES1) is given in Figure 18. 

EXAMPLE 5 - Modular structure ofCPSl. 

30 Predicted CPS1 protein (1743 amino acids, M r 193235) contains 

two structurally similar modules, both of which are similar to SafBl, the first 
module of saframycin synthetase B (overall 25% identity; 50% similarity) and 
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have apparent amino-acid-activating and thiolation domains but lack 
methyitransferase activity, thus appearing to be typical type I modules (Figure 
19). The number of amino acids in each module is different: the first module 
(CPS1A) consists of 574 amino acids (from the first residue of core 1 to the last 

5 residue of core 6), which is larger than most type I modules; the second module 
(CPS IB) has 530 amino acids, which is average. The distance between the two 
modules is 193 amino acids, much shorter than most peptide synthetases (500-600 
aa), but this distance is not highly conserved, i.e., an opposite variation is found in 
HC-toxin synthetase and cyclosporine synthetase, both of which have about 1,000 

1 0 aa between the first and second amino-acid-activating module (Figure 20F). 

Amino acid alignment of the two modules of CPS 1 to SafBl 
indicated that these modules are highly similar to each other in both overall amino 
acid composition and conserved motif sequences as defined by Stachelhaus and 
Marahiel (Stachelhaus et al, "Modular Structure of Peptide Synthetases Revealed 

1 5 by Dissection of the Multifunctional Enzyme Grs A," J. Biol. Chem., 

270(1 1):6 163-6 169 (1995); Marahiel, "Protein Templates for the Biosynthesis of 
Peptide Antibiotics," Chem. Biol. . 4(8):561-567 (1997), which are hereby 
incorporated by reference). When aligned to other bacterial or fungal peptide 
synthetases, CPS1 only showed local similarity to cyclosporine synthetase (Weber 

20 et al., "The Peptide Synthetase Catalyzing Cyclosporine Production in 

Tolypocladium niveum is Encoded by a Giant 45.8-Kilobase Open Reading 
Frame," Current Genetics, 26(2): 120-125 (1994), which is hereby incorporated by 
reference) and tyrocidine synthetase A (Mootz et al., "The Tyrocidine 
Biosynthesis Operon of Acillus brevis: Complete Nucleotide Sequence and 

25 Biochemical Characterization of Functional Internal Adenylation Domains," J. 
BacterioL 179(21):6843-6850 (1997), which is hereby incorporated by reference), 
but when the amino acids in motif regions were aligned, a overall conservation 
was observed. Both CPS1A and CPS IB have all five core sequences in the 
amino-acid-activating domain (Figure 20A-E). Cores 3 and 4 are well conserved 

30 except for the replacement of an aspartic acid residue of core 4 by a leucine in 
CPS1A. Cores 1, 2 and 5 show weak conservation, but similar variations are also 
seen in SafBl . A thiolation domain is found in both modules, which contains a 
highly conserved motif (core 6, Figure 20F). The serine residue in this motif has 
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been shown to be the active site for 4'-phosphopantetheine attachment 
(Schlumbohm et al, "An Active Serine is Involved in Covalent Substrate Amino 
Acid Binding at Each Reaction Center of Gramicidin S Synthetase," J- Biol. 
Chem. , 266(34):23135-23141 (1991); Stein et al., "Detection of 4'- 

5 Phosphopantetheine at the Thioester Binding Site for L-Valine of GramicidinS 
Synthetase 2," FEBS Lett. . 340(1 -2):39-44 (1994), which are hereby incorporated 
by reference). The distances between the six core sequences in the two modules 
are also largely conserved. Two exceptions are found in the first module, which 
has 312 aa between cores 2 and 3, larger than normal (150-200); 61 between cores 

10 5 and 6, only half of that of most peptide synthetases. SafBl also shows distance 
variations at these two interval regions (Figure 20B and E). In addition to amino- 
acid-activating and thiolation domains, CPS1 also has an integrated thioesterase 
domain (TE) in the carboxy-terminal end of CPS1B (Figure 19). A signature 
sequence GXSXG, which is highly conserved in animal fatty acid thioesterase 

1 5 type II enzymes and several peptide synthetases, is found in this domain (Figure 
21). 

EXAMPLE 6 - Sequence homology analysis of TES1 protein. 

The predicted TES1 protein consists of 367 amino acids (M r 

20 41013). Amino acid alignment of TES1 to hTE, TESB and Mycobacterium 

tuberculosis TESB homolog (Philipp et al., "An Integrated Map of the Genome of 
the Tubercle bacillus, Mycobacterium tuberculosis H37Rv, and Comparison with 
Mycobacterium leprae," Natl Acad. Sci. USA. 93(7):3 132-3 137 (1996), 
which is hereby incorporated by reference) showed that these proteins have an 

25 overall 40% identity and 60% similarity. A highly conserved VHS motif (putative 
active site) is found in the C-terminal region of TES1 at a conserved position 
(Figure 22). All these thioesterases have no sequence similarity with the 
previously identified animal type I or type II thioesterases known to be involved 
in the chain termination of fatty acid synthesis (Naggert et al., "Cloning, 

30 Sequencing and Characterization of Escherichia-coli Thioesterase II," J. Biol. 

Chem. . 266(17):11044-1 1050 (1991), which is hereby incorporated by reference). 
Interestingly, TES1 has more homology to hTE than to two bacterial genes, 
suggesting that both proteins belong to a new family of eucaryotic thioesterases. 
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EXAMPLE 7 - Targeted disruption of CPSL 

Disruption of either CPS1A or CPS1B restored the original mutant 
phenotype. Ten transformants from each of four individual disruption 

5 experiments using different constructs, including the plasmid recovered from the 
REMI insertion site in the mutant (p214B7) and three vectors for chromosome 
walking (p214SNP, pi 18BSP and pi 18BCS) were purified and assayed onN- 
cytoplasm corn. All transformants showed the same small lesion phenotype as 
that of the original REMI mutant. Gel blot analysis confirmed that all 

10 transformants showing the mutant phenotype resulted from homologous 

integration of the transforming vector that disrupted the wild type CPS1 (Figures 
24-26). No transformants showing the wild type phenotype were obtained, 
presumably because of the large genomic DNA fragments (over 800 bp in all 
disruption experiments) on the transforming vector that resulted from high 

1 5 efficiency of homologous recombination and the low chance to recover 
transformants with ectopic integration. 

EXAMPLE 8 - Methods and Materials for Examples 9-10 

Strains, growth conditions and transformation. Strains of 

20 Cochliobolus species and relatives used for genomic DNA hybridization are listed 
in Table 4. The strain HvW, a victorin-producing isolate of C victoriae was 
recovered from storage and grown on CMX medium (Turgeon et al., 
"Transformation of the Fungal Maize Pathogen Cochliobolus heterostrophus 
Using the Aspergillus nidulans amdS Gene," Mol. Gen. Genet. , 201 :450-453 

25 (1985), which is hereby incorporated by reference) for conidiation or on oat meal 
agar medium (Churchill et al., "Victorin-Deficient REMI Mutants of Cochliobolus 
victoriae Demonstrate a Requirement for Victorin in Pathogenesis," Fungal 
Genet. NewsL 42A:41 (1995), which is hereby incorporated by reference) for 
victorin detection at 24°C under warm white lights (Sylvania Inc., Danvers, MA). 

30 Transformation was done using the C heterostrophus procedure (Turgeon et al., 
"Cloning and Analysis of the Mating Type Genes from Cochliobolus 
heterostrophus;' Mol. Gen. Gene.. 238(l-2):270-284 (1993), which is hereby 
incorporated by reference). 
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Table 4. Detection of CPS! homologs in Cochliobolus spp and relatives 



Strain 3 


Host b 


EcoRl 
digest c 


Hybridization 
HindLll ■ 
digest d 


Bgia 

digest 6 


C. heterostrophus 
raceTfC^ 
race 0 (C5) 


Corn 
(Turf-13) 


+ 
+ 


5.2 3.2 
5.2 3.2 


4.2 
4.2 


C. carbonum 
race 1 f26R13) 
race 2 (YugY) 
race 3 (BZ 1703)* 


Corn* 
(hmlhml) 


+ 

XT 

N 
N 


6.6 
o.o 
6.6 


5.0 
5.0 


C. victoriae fHvW) 


OdXs{Vb) 


+ 


N 


J.U 


C sativus (A20) 


Grasses 2 


+ 


3.0 


N 


C s/?ec//er (D5-7) 


Grasses 2 


+ 


N 


N 


C. homomorphus 
(ATCC 13409) 


Unknown 


N 


5.8 


N 


C. dactyloctenii 
(7938-9) 


Unknown 


N 


5.9 


N 


5. furc/ca (NK2) 


Sorghum and 
maize 3 


+ 


N 


N 


5. ratfra/a (32197) 


Weeds and 
bamboo 4 


+ 


2.8 


N 


5. sacchari 
(764-1) 
(1249-10) 


Sugarcane 5 


+ 
N 


5.4 2.5 
5.4 2.5 


N 
N 



a. C. = Cochliobolus. S. = Setosphaeha. B. = Bioplaris. The name of isolates (or 
5 lab strains) of each species are given in parentheses and those known to produce 

host-specific toxins are underlined. * Provided by Tsukiboshi Takao (Japan) and 
the isolate could be either BZ1209 or BZ1 703. 

b. Genotype susceptible to the host-specific toxin-producing isolate is given in 
parentheses. References for hosts of those species not mentioned in the previous 

1 0 chapters are as follows: 1 : Welz et al., "Phenotypic Variation and Parasitic 
Fitness of Races of Cochliobolus-carbonum on Corn in North Carolina," 
Phytopathology. 83(6)593-601 (1993); Leonard et al., "Genetic Diversity in Field 
Populations of Cochliobolus-carbonum on Corn in North Carolina USA," 
Phytopathology. 80(1 1):1 154-1 159 (1990) (for races 2 and 3 only). 2: Domsch et 

1 5 al., "Compendium of Soil Fungi, Vol. 1," New York, New York:Academic Press, 
pp. 2 1 6-222 (1980). 3 : David et al., "Fungi on Plants and Plant Products," St. , 
Paul, Minnesota: APS Press, p. 635 (1989); Thakur et al., "Characterization of a 
New Race of Exserohilum-turcicum Virulent on Com With Resistance Gene 
HTN," Plant Pis. . 73(2): 15 1-1 55 (1989). 4: Rao et al., "New Fungal Diseases on 

20 Some Weeds," Indian Bot. Rep. . 6(1):38 (1987); Bhat et al., "Unrecorded 
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Pathogen on Bamboo Causing Blight in India," Curr. SCI. (BANGALORE), 
58(20): 1 148-1 149 (1989). 5: Yoder, "Toxins in Pathogenesis," Ann. Rev. 
PhytooathoL 18:103-129 (1980). 

c Genomic DNAs (from a previously prepared gel blot filter, Rose etal., A 
5 Decarboxylase Required for Polyketide Toxin Production and High Virulence by 
Cochliobolus heterostrophus," 8 th Tnt. Svmo. Mol. Plant-Microbe Int., Knoxville, 
p. J-49 (1996) were probed with the 3.4 kb CPSl fragment cloned on p214B7 
(Figure 2). "+" indicates a strong hybridization signal. All species hybridized to a 
large fragment (~23 kb). 
1 0 d Genomic DNAs selected from a lab collection were probed with the CPSl 3.2 
kb fragment cloned on P 214S1 (Figure 2). The size of fragments that hybridized 
to the probe is given in kb. The intensities of hybridization signals were similar to 
each other. N = not done. . 
e. Genomic DNAs were probed with the same CPSl fragment as in c. Gel blot is 

1 5 shown in Figure 26. 

DNA manipulations and targeted disruption of the CPSl 
homolog in C. victoriae. Genomic DNAs for probing were prepared according to 
Yoder ("Cochliobolus heterostrophus, Cause of Southern Corn Leaf Blight," in 
20 Sidhn, ed . Genetics of Plant Pathogenic Fungi . Vol. 6, San Diego, 

California:Academic Press, pp. 93-1 12 (1988), which is hereby incorporated by 
reference), or selected from a lab DNA collection (stored at 4°C). A gel blot filter 
bearing known genomic DNAs was also probed. Plasmid DNA preparation, 
restriction enzyme digestions, gel electrophoresis, gel blot analysis were done 
25 using standard protocols (Sambrook, et al., Molecular Cloninp: A Laboratory 
Manual, 2 nd Ed., Cold Spring Harbor, New York:Cold Spring Harbor Laboratory 
Press (1989), which is hereby incorporated by reference). For probing, CPSl 
fragments of C. heterostrophus cloned on p214B7 (3.4 kb left flank) and p214Sl 
(3.2 kb right flank) (Figure 2) were prepared by restriction enzyme digestion of 
30 the plasmid DNAs followed by purification using the QIAquick Gel Extraction 
Kit (QIAGEN Inc., Chatsworth, CA). The plasmid pi 1 8B14, which carries the 
2.2 kb Bgl II fragment of CPSl interrupted by the hygB cassette (see Figure 5) 
was linearized with BglU. and introduced into HvW genome. Transformants were 
purified by isolation of single conidia and genomic DNAs were digested with 
35 flg/II and probed with the CPSl 3 .2 kb fragment. 

Bioassays. Pathogenicity was determined by an oat plant assay. 
Fungal strains were grown on individual oat meal agar medium plates (60 X 1 5 
mm) containing hygromycin B (60 ug/ml) for 10 days at 24°C under lights. 
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Conidia were scraped from the plates and suspended in 6 ml of sterilized distilled 
water. One ml of conidial suspension of each strain was mixed with 60 seeds of 
susceptible or resistant oats. Inoculated seeds were planted in 4" X 6" pots and 
seedlings were allowed to grow for two weeks. Seed germination rate and 
5 symptom development were recorded at different stages (4, 6, 8 and 24 days after 
inoculation). Detection of victorin production using HPLC analysis was done by 
Alice Churchill in Dr. Vladimir Macko's lab at Boyce Thompson Institute for 
Plant Research. 

1 0 EXAMPLE 9 » Detection of CPS1 homologs. 

Genomic DNAs of 12 isolates (or lab strains) of 9 fungal species 
hybridized to CPS1 (Table 4). All 6 Cochliobolus species, including 4 known 
plant pathogens (C carbonum, C victoriae, C. sativus and C. spicifer) and 2 
species with unknown hosts (C. homomorphus and C. dactyloctenii) gave 

1 5 hybridization signals of the same intensity as that of C. heterostrophus CPS1 
fragments (Figure 26, only C. carbonum and C. victoriae are shown). Two 
phytopathogenic Setosphaeria species and Bioplaris sacchari, a sugarcane 
pathogen gave a similar hybridization intensity. 

CPS1 homologs appear to be polymorphic among different species, 

20 i.e., all species gave one or two unique bands when BgRl or HindBl digested 
genomic DNAs were probed (except for C. victoriae, which showed the same 
hybridization pattern as C. carbonum) (Table 4 and Figure 26). Interestingly, 
EcoKl digested genomic DNAs of the same species did not show polymorphisms; 
all species hybridized to a large fragment (-23 kb, Table 4), indicating the absence 

25 of an £coRI site in all CPS1 homologs as in the C. heterostrophus gene. In C 
heterostrophus, a > 12 kb of genomic region which includes CPS I (5.4 kb), TES1 
(1 .1 kb) and sequence downstream of the 3* end of CPS1 has no EcoRl sites. In 
contrast to species-dependent polymorphisms, CPS1 homologs appear to be 
highly conserved among different isolates of the same species. Both C 

30 heterostrophus race T and race O hybridized to the same 4.2 kb Bglil fragment (or 
5.2 and 3.2 kb M/idHI fragments); all three C carbonum races hybridized to the 
same 5.0 kb Bgtll fragment (or 6.6 kb HindlU fragment) (Table 4, Figure 26) and 
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B. sacchari isolates 764-1 and 1249-10 hybridized to the same Hindlll fragments 
(5.4 and 2.5 kb) (Table 4). 

EXAMPLE 10 Targeted disruption of CPS 1 homolog in C. victoriae. 
5 Twenty transformants were obtained from transformation of the 

victorin-producing isolate HvW with flg/IHinearized plasmid pi 18B14 (Figure 
5). Six transformants were purified and assayed for both victorin production and 
pathogenicity to susceptible oat plants. All transformants produced wild type 
levels of victorin as determined by HPLC analysis (Figure 27), but four of them 

10 (Tx7, Tx2, Tx5 and Tx8) showed dramatically reduced virulence in the plant 
assay. The seed germination rate on the eighth day after inoculation is only 13- 
25% for wild type and two transformants (Tx9 and Tx4), but 45-63% for the other 
four transformants. On day 24 after inoculation, all plants emerged from the seeds 
inoculated with wild type, Tx9 or Tx4 were killed but most (29-63%) from the 

1 5 seeds inoculated with Tx2, Tx7, Tx5 or Tx8 still survived (Table 5, Figure 28 A). 
Gel blot analysis confirmed that transformants showing the reduced virulence 
phenotype resulted from homologous integration of the transforming vector that 
disrupted the wild.type CPS1 homolog in C victoriae genome; transformants 
showing the wild type phenotype resulted from ectopic integration events that left 

20 the native gene intact (Figure 28B). All transformants remained nonpathogenic to 
resistant oats, indicating that disruption of the CPS1 homolog does not affect host 
specificity of the fungus. 

Table 5. Disease development of oat plants inoculated with C. victoriae 
25 transformants (Tx). 





No. germinated 1 * 


Germination 


No. survivors 


Strain* 


4 


6 


8 


Rate (%) c 


24 % 


Control- 1 


28 


41 


45 


75 


75 100 


Control-2 


40 


50 


50 


83 


50 100 


Control-3 


1 


7 


12 


20 


0 0 


Tx2 


8 


26 


27 


45 


16 59 


Tx4 


5 


15 


15 


25 


0 0 
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Tx5 


2 


24 


28 


47 


8 


29 


Tx7 


14 


36 


38 


63 


24 


63 


Tx8 


7 


29 


29 


47 


13 


47 


Tx9 


0 


3 


8 


13 


0 


0 



a. Control- 1 = uninoculated susceptible oat seeds. Control-2 and Control-3 = 
resistant and susceptible oat seeds inoculated with wild type C victoriae (isolate 
HvW), respectively. Six transformants were tested on both resistant and 

5 susceptible seeds, but only data for the later are shown (all transformants gave the 
same results as Control-2 when tested on resistant seeds). Repeat experiments 
gave similar results (data not shown). 

b. Sixty oat seeds were used for each strain. Emerged oat plants were counted 4, 
6 and 8 days after inoculation. 

10 c. Calculation based on the data collected on the day 8. 

d. Recorded on day 24 after inoculation. The percentage of survivors is based on 
the number of plants recorded on days 8 and 24. 

EXAMPLE 11 - Isolation ofCPSl genes from other plant pathogens 
1 5 As dislosed in the previous examples, the Cochliobolus 

heterostrophus gene CPS1 encodes a peptide synthetase that appears to be a 
general factor for fungal virulence to their hosts. Thus, CPS1 has been found to 
be highly conserved among at least 9 fungal species belonging to 3 genera 
including the genus Cochliobolus and closely related genera Bioplaris and 
20 Setosphaeria; it has been demonstrated to be required for pathogenesis of three 
different plant pathogens i.e., C heterostrophus race O , race T to corn and C 
victoriae to oats (Lu, 1998, "Molecular-genetic analysis of general and specific 
pathogenesis factors in Cochliobolus heterostrophus" Ph.D thesis, Cornell 
University). 

25 To further explore the role of CPS1 in fungal pathogenesis and its 

conservation in other fungi, genomic DNAs of additional species in Cochliobolus 
and other closely or distantly related genera were probed with ChCPSl by DNA- 
DNA hybridization (Lu, S.-W., B. G. Turgeon and 0. C. Yoder. 1999. "A gene 
cluster from the corn pathogen Cochliobolus heterostrophus required for 

30 nonribosomal peptide biosynthesis and general virulence of fungi." Fungal 

Genetics Conference, March 1999, Pacific Grove, California). Genomic DNAs 
of 40 filed isolates (or lab strains) representing 34 fungal species belonging to 16 
genera hybridized when probed with ChCPSl (Figs. 30A-30C). 
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Fungal genomic DNAs were prepared according to a previously 
described procedure (Yoder, 1988. "Cochliobolus heterostrophus, cause of 
Southern Com Leaf Blight". Genetics of Plant Pathogenic Fungi. G. S. Sidhu. San 
Diego, Academic Press. 6: 93-1 12). Plasmid DNA preparations, restriction 
5 enzyme digestions and preparation of DNA gel blots were performed following 
standard protocols (Sambrook, J., E. F. Fritsch, et al. 1989 "Molecular Cloning: A 
Laboratory Manual, 2nd Edition". Cold Spring Harbor, NY, Cold Spring Harbor 
Laboratory Press). A 3.2 kb ChCPSl fragment (corresponding to ChCPS 1 amino 
acids 173-1208) was obtained by restriction enzyme digestion of a plasmid clone 
10 P 214S1 (Lu, 1998, Ph.D thesis), (see Figure 2), followed by purification using the 
QIAquick Gel Extraction Kit (QIAGEN Inc., Chatsworth, CA). The purified 
CfcCPSyfragment was labeled with a-[ 32 P]dCTP (Turgeon, B. G., H. Bohlmann, 
et al, 1993. "Cloning and analysis of the mating type genes from Cochliobolus 
heterostrophus." Mol. Gen. Genet. 238: 270-284.). DNA-DNA hybridization 
15 was carried out at 62° C in 6 X SSC, 0.05 X BLOTTO (Sambrook et al., 1989). 
Filters were washed in 2 X SSC, 0.1% SDS at 62° C for 60 minutes. 

All 16 Cochliobolus species, including the known plant pathogens 
C. carbonum, C. victoriae, C. miyabeanus, C. sativus and C. spicifer, and five 
genera closely related to Cochliobolus, i.e., Pyrenophora, Setosphaeria, Bipolaris, 
20 Stemphyllium and Alternaria showed hybridization intensities comparable to that 
of C. heterostrophus itself (Fig. 30A). 

DNAs of species from nine distantly related genera, including 
several of economic importance (e. g., Magnaporthe grisea, Fusarium 
graminearum, Gaeumannomyces graminis) or of medical importance (e. g., 
25 Candida albicans) hybridized weakly to CPS1 (FigS. 30B, 30C) whereas no 
signal was detected in DNA of the basidiomycete Ustilago maydis. 

Three CPS1 homolog genes were cloned and characterized. 
Three of them were cloned from phytopathogenic fungi, including the wheat 
head scab fungus Fusarium graminearum {FgCPSl, 6003 bp, SEQ. No. 43), 
30 the potato early blight fungus Alternaria solani, (AsCPSl , 2369bp, SEQ. No. 
41) and the barley net blotch fungus Pyrenophora teres (PtCPSl, 2306 bp, 
SEQ. No. 45). FgCPSl was cloned as a full length gene using both PCR 
amplification and the plasmid rescue procedure that was preceded by targeted 
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gene disruption in the genome. AcCPSl and PtCPSl homologs were partially 
cloned by direct PCR amplification. 

The Polymerase Chain Reaction (PCR) was carried out using 
degenerate primers designed to conserved regions of C heterostrophus CPSI 
5 (ChCPSl). Two sets of degenerate primers were designed to amino acids at or 
close to conserved core sequences of C. heterostrophus CPS! (ChCPSl). The 
first pair of primers: 5 ' TG YTT Y ATHGCNGGN GTNGTNGCNGTNCC3 ' 
(CHFP6, corresponding to positions 493-521 of ChCPSl) and 
5 ' YTG YTGNGGNGGNCCNCCNGGRTT3 ' (CHRP4, 2197-2220 of 

10 ChCPSl), was used to amplify CPSI from Fusarium graminearum. The 

second pair of primers: 5'-AARAARAARGGNCCNACNGAG-3' (FP4CB, 
corresponding to positions 1531-1550 of ChCPSl) and 5'SRYTGNA 
CCCAD ATYTCNCC3 5 (RP2DB, corresponding to positions 3883-3902 of 
ChCPSl), was used to amplify CPSI from A. solani and Pyrenophora teres. 

1 5 PCR was carried out in a Perkin Elmer Cetus 9600-thermocycler with fungal 
{Fusarium graminearium, Alternaria solani, and Pyreophora teres) genomic 
DNA as a template. 

Reaction mixtures contained about 500 ng of genomic DNA in 100 ul 
of reaction buffer [lx£x Taq buffer, 0.2 mM dNTPs, 0.2 uM of each primer 

20 and 0.05 U/ml Takara Ex Taq (Pan Vera Corporation)]. An initial denaturing 
step of 95 °C for 3 min. was followed by 30 cycles of 94 °C for 1 min, 47 °C 
(for G. zeae) or 55 °C (for A. solni and Pyrenophora teres) for 3 min, and 72°C 
for 3 min. Reactions were cooled to 4 °C after a final extension of 72 °C for 10 
min. PCR products were examined (10 ul of each reaction) by agarose-gel 

25 (0.75%) electrophoresis. 

DNA from isolated clones of the three different fungi was 
sequenced at the Cornell DNA Sequencing Facility using TaqCycle automated 
sequencing with DyeDeoxy terminators (Applied Biosystems, Foster City, 
CA). Primers used for sequencing were designed using Primer Select 

30 (DNASTAR) and synthesized by the Cornell Oligonucleotide Synthesis 

Facility. Sequencing of each plasmid clone was initiated with vector-specific 
primers or primers designed to previously determined sequences. Sequences 
were analyzed using MapDraw and MegAlign (DNASTAR) and nucleotide or 
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protein database searches were performed with the BLAST program (Altschul 
et al., 1990, 1997). 

The FgCPSl open reading frame (5125 bp, SEQ. No. 43) has 
50% nucleotide identity to ChCPSl (SEQ ID No.2) in about 4.4 kb of overlap 

5 (Fig. 2). No "TATA" box-like element was found in the 5' untranslated region, 
but other promoter sequences including two putative "CAAT" boxes and a 
"CT" motif were located upstream of the start codon (ATG) in FgCPSl (Fig. 
32). Only one putative intron was found 1508 bp upstream the stop codon 
(TGA) in contrast to three in ChCPSl (Figs. 31 A, 3 IB and 32). A putative 

10 polyadenylation signal "AATAA" was located 62 bp downstream of the stop 
codon (Fig. 32). The predicted FgCPSl protein (1692 amino acids, M v 187983 
Da, SEQ ID No. 44) has 68% identity, 73% similarity to ChCPSl (SEQ ID No. 
3) in a about 1 ,500 amino acid overlap (Figs. 3 1 A and 3 IB) that contains two 
structurally similar modules highly similar to those of ChCPS 1 . FgCPS 1 has 

1 5 no significant similarity to ChCPS 1 at the C-terminus, which is relatively 
shorter and lacks the thioeterase domain as seen in ChCPSl (Figs. 31 A and 
3 IB). The annotated FgCPSl sequence is given in Figure 32. 

AsCPSl (2369 bp, SEQ. No. 41) has 76% nucleotide identity to 
ChCPSl (SEQ ID No.2) in the entire cloned region which contains two conserved 

20 introns (Figs. 31 A and B). The translated AsCPSl protein (partial) includes 758 
amino acids corresponding to amino acids 51 1-1269 in ChCPSl and has up to 
93% identity, 95% similarity to ChCPSl (Fig. 2). The annotated AsCPSl 
sequence is given in Figure 33. 

PtCPSl (2306 bp, SEQ. No. 45) has 78% nucleotide identity to 

25 ChCPSl (SEQ IDNo.2) in the entire cloned region which contains only one 

intron (Fig. 2), The translated PtCPSl protein (partial) includes 758 amino acids 
corresponding to amino acids 51 1-1269 in ChCPSl and has 93% identity, 96% 
similarity to ChCPSl (Fig. 2). The annotated PtCPSl seqence is given in Figure 
34. 

30 

EXAMPLE 12 - Targeted disruption of CPS 1 homolog in F. graminearunu 

A 2.2 kb Xbal fragment from pUCATPH (Lu, et al, 1994) 
containing the bacterial hygmycin resistance gene (hygB) driven by the 
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Aspergillus nidulans trpC promoter was inserted into the Xbal site of a PCR clone 
pFgC8, which carries the 1.0 kb internal fragment of FgCPSl, to create P FgC8- 
hygB. This construct was transformed (in a circular form or linearized with 
/Mil or 5g/II) into an isolate (GZ3639) of wild type F. graminearum. Twenty 
5 transformants were obtained by transformation of a wild type isolate (GZ3696) 
with circular plasmid pFgC8-hygB. DNA gel blot analysis of eight such 
transformants confirmed that the CPS1 homolog in F. graminearum was disrupted 
by a single cross over recombination (Fig. 35A). Two transformants were 
obtained when the Bg/II-digested pFgC8-hygB was used and only one recovered 
10 when the plasmid was digested with Hindlll. Gel blot analysis indicated that all 
three transformants obtained using linearized plasmid integrated into the genome 
at a ectopic location that left the wild type FgCPSl gene intact (Fig. 35A). For 
virulence assays, F. graminearum strains were grown on PDA (or PDA plus 
hygromycin B for transformants) plates (100 X 15 mm) for 7-15 days at 24 C 
1 5 under black lights (Sylvania Inc., Danvers, MA) for maximum conidiation. A 
susceptible spring wheat cultivar, Norm Hard Red (kindly provided by G. 
Bergstrom, Cornell university) was used. Two months old wheat plants grown in 
the greenhouse at anthesis (10 plants in one 4" X 6" pot) were sprayed with 10 ml 
conidial suspensions (10* or 10* conidia/ml) using a pressurized Preval Spray 
20 Gun Power Unit thin layer chromatography sprayer (Alltech Associates, 
Deerfield, IL) or "injected" into the spikelets (5-10 ul/ per spikelet) using a 
Benchmate pipettor (NICHIRYO, Japan). The inoculated plants were incubated in 
a mist chamber for 48 hours (23° C) and then transferred to a growth chamber (23° 
C, 14 hours of light). Mutant phenotypes were identified by the comparison of the 
25 number of infected or "bleached" spikelets on each head of wheat plants. 

Symptoms were recorded 7-10 days after inoculation. All cpsl' disruptants had 
reduced virulence on wheat plants in the assays while ectopic transformants 
caused disease symptoms indistinguishable from those of wild type (Fig. 35B). 
Although the invention has been described in detail for the 
30 purposes of illustration, it is understood that such detail is solely for that purpose, 
and variations can be made therein by those skilled in the art without departing 
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from the spirit and scope of the invention which is defined by the following 
claims. 
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WHAT IS CLAIMED IS: 

1 . An isolated nucleic acid molecule from a plant pathogen encoding 
a CPS1 synthetase, wherein the nucleic acid molecule comprises a nucleotide 
5 sequence which hybridizes under stringent conditions to a nucleic acid molecule 
having a sequence as set forth in SEQ ID No. 41 or a complement thereof; or 
wherein the nucleic acid molecule comprises a nucleotide sequence as set forth in 
SEQ ID No. 41. 

1 0 2. The isolated nucleic acid molecule according to claim 1 wherein 

the plant pathogen is selected from the group consisting of Alternaria solani, 
Alternaria Alternaia alternatherae, A. alternata, A, amaranthi, A. araliae, A. 
brassicae, A. brassicicola, A. camelliae, A. cassiae, A. cheiranthi, A. cinerariae, 
A. gossypii, A. helianthi, A, helianthinficiens, A. mail, and A raphanl 

15 

3. The isolated nucleic acid molecule according to claim 2 wherein 
the plant pathogen is Alternaria solani. 

4. An isolated nucleic acid molecule coding for a polypeptide having 
20 the amino acid sequence as„set forth in SEQ ID No. 42. 

5. An isolated nucleic acid molecule comprising the nucleotide 
sequence as set forth in SEQ ID No. 41. 

25 6. An isolated polypeptide encoded by the nucleic acid molecule of 

claim 1. 

7. The polypeptide according to claim 6, wherein the polypeptide has 
an amino acid sequence which has at least 75 % similarity to the amino acid 
30 sequence of SEQ. ID. No. 42 as determined by a BLAST program with the default 
parameters. 
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WHAT IS CLAIMED IS: 

1 . An isolated nucleic acid molecule from a plant pathogen encoding 
a CPS1 synthetase, wherein the nucleic acid molecule comprises a nucleotide 
5 sequence which hybridizes under stringent conditions to a nucleic acid molecule 
having a sequence as set forth in SEQ ID No. 41 or a complement thereof; or 
wherein the nucleic acid molecule comprises a nucleotide sequence as set forth in 
SEQ ID No. 41. 

1 o 2. The isolated nucleic acid molecule according to claim 1 wherein 

the plant pathogen is selected from the group consisting of Alternaria solani, 
Altemaria Alternaia alternatherae, A. alternate*, A. amaranthi, A. araliae, A. 
brassicae, A. brassicicola, A, camelliae, A. cassiae, A. cheiranthi, A. cinerariae, 
A. gossypii, A. helianthi, A. helianthinficiens, A. mail, and A. raphanl 

15 

3. The isolated nucleic acid molecule according to claim 2 wherein 
the plant pathogen is Alternaria solani. 

4. An isolated nucleic acid molecule coding for a polypeptide having 
20 the amino acid sequence as^set forth in SEQ ID No. 42. 

5. An isolated nucleic acid molecule comprising the nucleotide 
sequence as set forth in SEQ ID No. 41 . 

25 6. An isolated polypeptide encoded by the nucleic acid molecule of 

claim 1. 

7. The polypeptide according to claim 6, wherein the polypeptide has 
an amino acid sequence which has at least 75 % similarity to the amino acid 
30 sequence of SEQ. ID. No. 42 as determined by a BLAST program with the default 
parameters. 
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8. The polypeptide according to claim 7 wherein the polypeptide 
comprises an amino acid sequence as set forth in SEQ ID No. 42. 

9. An isolated nucleic acid molecule from a plant pathogen encoding 
5 a CPS1 synthetase, wherein the nucleic acid molecule comprises a nucleotide 

sequence which hybridizes under stringent conditions to a nucleic acid molecule 
having a sequence as set forth in SEQ ID No. 43 or a complement thereof; or 
wherein the nucleic acid molecule comprises a nucleotide sequence as set forth in 
SEQ ID No. 43. 

10 

10. The isolated nucleic acid molecule according to claim 1 wherein 
the plant pathogen is selected from the group consisting of Fusariumi 
graminearium, Fusarium avenaceum, F. carpineum, F. chlamydosporum, F. 
coccophilum, F. culmorum, F. episphaeria, F. equiseti, F flocciferum, F. 

15 moniliforme, F. oxysporum, F. redolens, F. sambucinum, F. solani, F, 
subglutinans, F. tricholhecioides, F. udum, or F. ventricosum. 

11. The isolated nucleic acid molecule according to claim 10 wherein 
the plant pathogen is Fusarium graminearium. 

20 

12. An isolated nucleic acid molecule coding for a polypeptide having 
the amino acid sequence as set forth in SEQ ID No. 44. 

13. An isolated nucleic acid molecule comprising the nucleotide 
25 sequence as set forth in SEQ ID No. 43. 

14. An isolated polypeptide encoded by the nucleic acid of claim 9. 

15. The polypeptide according to claim 14, wherein the polypeptide 
30 has an amino acid sequence which has at least 75 % similarity to the amino acid 

sequence of SEQ. ID. No. 44 as determined by a BLAST program with the default 
parameters. 
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16. The polypeptide of claim 1 5 wherein the polypeptide comprises the 
amino acid sequence as set forth in SEQ ID No. 44. 

17. An isolated nucleic acid molecule from a plant pathogen encoding 
5 a CPS 1 synthetase, wherein the nucleic acid molecule comprises a nucleotide 

sequence which hybridizes under stringent conditions to a nucleic acid molecule 
having a sequence as set forth in SEQ ID No. 45 or a complement thereof; or 
wherein the nucleic acid molecule comprises a nucleotide sequence as set forth in 
SEQ ID No. 45. 

10 

1 8. The isolated nucleic acid molecule according to claim 1 wherein 
the plant pathogen is selected from the group consisting of Pyrenophora teres, 
Pyrenophora avenae, P. bromi, P. leuceienes, P. phaeocomes, P. schroeteri, P. 
trichostoma, or P. tritici-repentis. 

15 

1 9. The isolated nucleic acid molecule according to claim 1 8 wherein 
the plant pathogen is Pyrenophera teres. 

20. An isolated nucleic acid molecule coding for a polypeptide having 
20 the amino acid sequence as set forth in SEQ ID No. 46. 

21. An isolated nucleic acid molecule comprising the nucleotide 
sequence as set forth in SEQ ID No. 45. 

25 22. An isolated polypeptide encoded by the nucleic acid of claim 1 7. 

23. The polypeptide according to claim 22, wherein the polypeptide 
has an amino acid sequence which has at least 75 % similarity to the amino acid 
sequence of SEQ. ID. No. 46 as determined by a BLAST program with the default 

30 parameters. 

24. The polypeptide of claim 23 wherein the polypeptide comprises the 
amino acid sequence as set forth in SEQ ID No. 46. 
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25. A vector comprising the nucleic acid molecule of any one of claims 
1-5, 9-13, or 17-21. 

26. A vector according to claim 25, wherein the nucleic acid molecule 
5 is operably linked to a promoter. 

27. A vector according to claim 26, wherein the nucleic acid molecule 
is in a sense orientation. 

10 28. A vector according to claim 26, wherein the nucleic acid molecule 

is in an antisense orientation. 

29. The vector according to claim 26, wherein the vector is capable of 
maintaining and expressing the nucleic acid molecule in bacterial cells. 

15 

30. The vector according to claim 26, wherein the vector is capable of 
maintaining and expressing the nucleic acid molecule in plant cells. 

31. A host cell transformed with the vector according to claim 25. 

20 

32. A host cell according to claim 3 1 , wherein the host is a plant. 

33. A host cell according to claim 3 1 , wherein the host is selected from 
the group consisting of com, oats, grasses, weeds, bamboo, and sugarcane. 

25 

34. A host cell according to claim 3 1 , wherein the host is corn. 

35. A plant transformed with the nucleic acid molecule according of 
any one of claims 1-5, 9-13, or 17-21. 

30 

36. An isolated DNA molecule according to claim 1 , wherein the 
nucleic acid molecule has a nucleotide sequence which is 70% identical to the 
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nucleotide sequence of SEQ. ID. No. 41 as determined by a BLAST program 
with default parameters. 

37. An isolated DNA molecule according to claim 1 , wherein the 

5 nucleic acid molecule is 80% identical to the nucleotide sequence of SEQ. ID. 
No. 41 as determined by a BLAST program with default parameters. 

38. An isolated DNA molecule according to claim 1 , wherein the 
nucleic acid molecule is 90% identical to the nucleotide sequence of SEQ. ID. 

10 No. 4 1 as determined by a BLAST program with default parameters. 

'39. An isolated DNA molecule according to claim 1 , wherein the 
nucleic acid molecule has a nucleotide sequence which is 70% identical to the 
nucleotide sequence of SEQ. ID. No. 43 as determined by a BLAST program 
1 5 with default parameters. 

40. An isolated DNA molecule according to claim 1 , wherein the 
nucleic acid molecule is 80% identical to the nucleotide sequence of SEQ. ID. 
No. 43 as determined by a BLAST program with default parameters. 

20 

41. An isolated DNA molecule according to claim 2 1 , wherein the 
nucleic acid molecule is 90% identical to the nucleotide sequence of SEQ. ID. 
No. 43 as determined by a BLAST program with default parameters. 

25 42. An isolated DNA molecule according to claim 1 , wherein the 

nucleic acid molecule has a nucleotide sequence which is 70% identical to the 
nucleotide sequence of SEQ. ID. No. 45 as determined by a BLAST program 
with default parameters. 

30 43. An isolated DNA molecule according to claim 1 , wherein the 

nucleic acid molecule is 80% identical to the nucleotide sequence of SEQ. ID. 
No. 45 as determined by a BLAST program with default parameters. 
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44. An isolated DNA molecule according to claim 2 1 , wherein the 
nucleic acid molecule is 90% identical to the nucleotide sequence, of SEQ. ID. 
No. 45 as determined by a BLAST program with default parameters. 

5 45. A method for identifying inhibitors of a CPS 1 protein , wherein 

said CPS1 protein is a peptide synthetase of a plant pathogen, said method 
comprising: 

providing a CPS1 protein or polypeptide; 

contacting the protein or polypeptide with potential inhibitor 

10 compounds; 

determining peptide synthetase activity; and 

selecting compounds which decrease the peptide synthetase 

activity. 

15 46. The method of Claim 45 wherein the CPS 1 protein is from 

Alter nana solani. 

47. The method of Claim 45 wherein the CPS 1 protein is from 
Fusarium graminearium. 

20 

48. The method of Claim 45 wherein the CPS 1 protein is from 
Pyrenophora teres. 

49. A method of imparting disease resistance to a plant, said method 
25 comprising over-expressing a CPS1 polypeptide in the plant, wherein the 

polypeptide has protein synthetase activity. 

50. A method according to claim 49, wherein the plant is selected from 
a group consisting of corn, oats, grasses, weeds, sugarcane, barley, wheat, rice, 

30 tomato, potato, citrus, malus, rye, cotton, brassica, cabbage, and carrot. 

51. The method of claim 49 wherein the CPS 1 polypeptide is from 
Alternaria solani. 
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52. The method of Claim 49 wherein the CPS 1 polypeptide is from 
Fusarium graminearium. 

53. The" method of Claim 45 wherein the CPS 1 polyeptide is from 
5 Pyrenophora teres. 
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TGCCTGCGCCTGTGCTTGTGCCTGTGGAATGTCGCGGCCCGCTGCTGCAT - 776 

AGCCTATCTGTACATACAACACCATCCCATCCCGCTTCACCTGCCTTGCC -726 

TCCCTCCTCGTGCCACACATCCGCCGCCCACAACACCATGGCTGCGACCA -676 

ACCCCGAGCTGCAGGCCAAACTGCAGGAGCTGGACCACGAGCTCGAGGAG -626 

GGCGATATTACACAAAAAGGGTCCGTACTGCTGCACCACCACCGCCATCC -576 

GCCTCTCTGCGTGCGCTAATCAGTCGCATAGCTATGAAAAACGTCGCACC -526 

GTGCTGCTGTCGCAGTATCTAGGGCCTGACTTTGCTGCCCAGTTGCAGGC -476 

CGACCTGAACCAGCAGAACCCACCCCAACCATCCAGTGAGGGCTCTCGCT -H26 

CCCGCACCGCATCCTTTGCTATTCCGTCCGGTCCGAGTCCATCACGGCGA -376 

CCACAACCCCCACATATCCAGCTCCCCCGCCCCGACTCATACCATGACGC -326 

TTCCGCACAGGGCCAATTGGGCGCACCCATGCCATATGCGAACGCCTCCG -276 

CCGCTGCCTCGGGGGGCTCGCAGTACATGGCATACCCGCCCAGCCAAGTC -226 

GGCCGTTTTCAAGAGAAGCAGCTGGGCCTGCGTACAAATTCGCTCCAGCG - 1 76 

C AAT TCCTCACAGCTGTCGCAAGGAAGCGAGACGTTCATTCCACGGCCTC - 1 26 

AAACGCCTGAATACAACCACTCGCGCGAGCCCACCATGATGGGCAACTAC -76 

gccttVaatccagacmtcagcaaagttatgatggccaatttggctctcc -26 

GGGAGAGGCCAGTCGAAGGAGCACCAIfiCTCGAGGTAAACCAGGGTTATT 25 

MLEVNOGY 

TTTCCGACTTCACAGGCCAGCAGATGCAAGACAATCGCGACTCGTATGGG 75 
FSDFTGQQMQDNRDSYG 

GGACCCAACCGCTACTCGTCGGGAGATGCCTTTTCTCCTACCGCCGCGAT 1 25 
GPNRYSSGD.AFSPTAA I 

TCCACCTCCCATGATGAACCCCAACGATCTCCCCTTGGGCGCTGCTGAAA 1 75 
PPPMMNPNDLPLGAAE 

CCATGATGCCGCTAGAGCCCCGCGATCTGCCTTTTGACGTTTACGACCCT 225 
TMMPLEPRDLPFDVYDP 

FIG. 17 A 



SUBSTITUTE SHEET (RULE 26) 



WO 01/38489 



PCT/USOO/32227 



18/51 



CACAACCCCAATGTCAAAATGTCAAAGTTTGACAACATTGGCGCTGTCTT 275 
HNPNVKMSKFDNIGAVL 

GCGTCACCGAAGTCGCACACAGCCAAGGACGACTGCCTTCTGGGTCCTTG 325 
RHR SRTQPRTTAFWVL 

ACGCAAAAGGCAAAGAGACGGCGTCCATCACCTGGGAAAAGGTGGCTAGT 375 
DAKGKETASITWEKVAS 

CGCGCGGAAAAGGTGGCCAAAGTGATTCGGGACAAGAGCAACCTCTATCG 425 
RAEKV AKVIRDKSNLYR 

AGGCGACCGTGTGGCATTAGTGTACAGGGATACAGAAATCATTGATTTTG 475 
GORVALVYRDT EI IDF 

TCGTGGCGTTGATGGGC TGCTTCATTGCGGGCG TTGTAGCGGTACCCATC 52 5 

V V A L M G gM^^ 

Corel 

AATAGCGTCGACGACTACCAGAAACTCATTCTTCTCCTAACGACAACTCA 575 
DYQKLIL LLTTTQ 



AGCTCATCTCGCATTGACCACAGACAACAATCTCAAGGCCTTTCATCGTG 625 
AHLALTTDNNLKAFHR 

ACATTAGTCAGAACCGTCTGAAATGGCCGAGTGGGGTAGAGTGGTGGAAG 675 
D I SQNRLKWPSGVEW WK 

ACGAACGAGTTTGGCAGCCACCACCCCAAGAAACATGACGATACTCCAGC 725 
TNEFGSHHPKKHODTPA 

TTTGCAAGTACCAGAGGTTGCCTATATTGAGTTCTCGCGTGCACCTACTG 775 
LQVPEVAYIE 



Core 2 

GTGACCTTCGCGGTGTGGTGCTTAGTCACCGGACTATTATGCACCAAATG 825 

M H Q M 



GCCTGCATCAGTGCCATGATTAGCACGATACCCACCAACGCTCAGAGCCA 875 
ACISAM ISTIPTNAQSQ 

AGACACGTTCAGCACTAGCCTACGGGATGCAGAGGGAAAGTTCGTTGCTC 925 
DTFSTSLROAEG KFVA 

CAGCACCGTCCAGAAACCCCACAGAAGTGATCCTCACGTACCTCGACCCG 975 
PAPSRNPTEVILTYLDP 

CGCGAAAGCGCTGGTCTCATTCTCAGTGTCTTGTTTGCAGTTTATGGAGG 1025 
RESAGL ILSVLF AVYGG 



FIG. 17B 



SUBSTITUTE SHEET (RULE 26) 



WO 01/38489 



PCT/US00/32227 



19/51 



CCACACCACCGTATGGCTCGAGACAGCGACCATGGAAACCCCGGGTCTAT 1075 
HTTVWLETATMETPGL 

ATGCACATCTCATCACCAAATACAAGTCCAACATACTGCTAGCGGATTAC 1 125 
YAHL I TKYKSN I LLADY 

CCAGGCCTCAAGCGCGCTGCATACAACTACCAACAGGATCCAATGGCTAC 1 175 

PGLKRAAYNYQQDPMAT 

AAGAAACTTCAAGAAAAACACAGAACCCAACTTCGCCTCCGTGAAGATCT 1 225 
RNFKKNTEPNFASVK I 

GTCTGATTGAC ACGCTTACCGTCGACTGTGAATTTCACGAAATTCTCGGA 1 275 
CL I DTLTVDCEFHE I L G 

GATCGATATTTCAGGCCACTGCGAAACCCTAGAGCGCGAGAACTGATCGC 1 325 
DRYFRPLR NPRAREL I A 

GCCAATGCTCTGCTTGCCAGAACATGGTGGAATGATAATATCTGTACGCG 1375 
PMLCL.PEHGGMI I .SVR 

ACTGGCTAGGTGGAGAGGAGCGCATGGGCTGCCCGCTAAGC ATAGCAGTA 1 425 
DWLGGEERMGCPLS I AV 

GAAGAGTCAGATAATGATGAAGATGATACAGAGGATAAGTATGCAGCGGC 1 475 
EESDNDEDDTED KYAAA 

AAATGGCTACTCCAGTCTTATTGGTGGTGGCACTACAAAGAACAAAAAGG 1 525 
NGYSSL IGGGTTKNKK 

AGAAGAAGAAGAAAGGCCCGACAGAGCTTACAGAAATCTTGCTGGACAAG 1 575 
EKKKKG PTELTE I LLDK 

GAAGCTCTGAAGATGAACGAAGTCATTGTTCTGGCCATTGGAGAAGAAGC 1 625 
EALKMNEVIVLAIGEEA 

AAGCAAGCGGGCAAACGAGCCCGGCACCATGCGAGTCGGTGCCTTTGGAT 1 675 
SKRANEPGTMRVGAFG 

ACCCCATACCGGATGCGACACTAGCTATTGTAGACCCTGAGACAAGTCTT 1 725 
Y PIPOATLAIVOPETSL 

CTATGTTCACCATACTCGATAGGCGAGATCTGGGT^ATTCGCCTTCACT 1 775 

Core 3 

CTCTGGTGGCTTCTGGCAGCTGCAGAAGCATACAGAGACCATTTTCCATG 1 825 
HHHHBHB WQLQKHTET I FH 
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CTCGACCATACCGTTTCGTTGAGGGTAGCCCTACGCCACAGTTGCTTGAA 1875 
ARPYRFVGGSPTPQLLE 

CTCGAGTTTCTGCGTACTGGACTCCTCGGCTTTGTTGTAGAGGGAAAAAT 1925 

L e f iwmmmmmmmmmv v e g k i 

Core 4 



ATTTeTCCTTCBACTGTACGAAGATCGCATCAGACAGCGTGTTGAAJGGG 1975 

F V 

CoreS 

TAGAAAATGGTCAGCTTGAAGCCGAGCATCGATACTTTTTTGTGCAGCAC 2025 

AEHRYFFVGH 



CTGGTCACAAGCATTATGAAGGCCGTGCCAAAAATTTACGACTGgtaagt 2075 
LVTSIMKAVPKIYDC 

gagctgccaacagagcaaggactgtctaacgtg-tcatagCTCGTCGTTTG 2125 

S S F 

ATTCTTATGTAAATGGTGAATACCTGCCAATCATTCTCATCGAGACGCAG 2175 
DSYVNGEYLPI ILIETQ 

GCCGCATCGACTGCGCCCACAAACCCAGGTGGACCACCACAACAATTGGA 2225 
AASTAPTNPGGPPQQLD 

TATACCATTTTTGGATTCACTATCTGAGAGGTGCATGGAGGTCCTTTACC 2275 
I P F ERCMEVLY 
Core 6 

AAGAGCATCATTTACGGGTATACTGCGTGATGATTACAGCACCTAATACA 2325 
QEHHLRVYCVMITAPNT 

CTTCCACGAGTCATCAAGAACGGACGGCGAGAAATTGGCAATATGCTGTG 2375 
LPRV I KNGRRE IGNMLC 

TAGGAGAGAGTTTGACAATGGCTCTCTGCCCTGTGTNCACGTNAAGTTTG 2425 
RREFDNGSLPCVHVKF 

GCATTGAGCGATCAGTGCAGAACATTGCGCTCGGTGACGATCCCGCTGGC 2475 
GIERSVQNIALGDDPAG 

GGCATGTGGTCATTtGAGGCATCAATGGCACGTCAGCAATTCTTGATGCT 2525 
GM WSFEASMARQQFLML 

CCAAGACAAGCAATACTCTGGTGTCGATCATCGCGAAGTCGTCATTGACG 2575 
QDKQYSGVD HREVV I 0 

. ACAGGACATCGACTCCACTCAATCAGTTCTCGAATATCCACGACCTGATG 2625 
DRTS TPLNQFSNIHDLM 
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CAATGGC6TGTATCTCGGCAG6CCGAGGAACTTGCTTACTGCACTGTCGA 2675 
QWRVSRQAEELAYCTVD 

CGGTCGAGGAAAAGAGGGCAAAGGCGTCAATTGGAAGAAGTTTGATCAAA 2725 
GRGKEGKGVNWKKFDd 

AGGTTGCGGGCGTAGCAATGTACCTCAAGAACAAGGTCAAGGTCCAGGCC 2775 
KVAGVAMYLKNKVKVQA 

GGCGATCATCTCCTTCTGATGTACACGCATTCAGAAGAATTTGTTTATGC 2825 
GDHLLLMYT. HSEEFVYA 

TGTTCATGCATGTTTTGTGCTTGGAGCTGTTTGCATACCAATGGCGCCAA 2875 
Coral 

TTGATCAGAACCGGTTGAATGAGGATGCGCCGGCCTTGCTGCATATCCTT 2925 
Hi QNRLNEDAPALLH I L 
GCAGATTTCAAGGTCAAAGCCATTCTTGTCAACGCTGACGTTGACCATCT 2975 
AOFKV KAI LVNADVDHL 

GATGAAGATCAAGCAAGTATCGCAGCACATCAAACAATCGGCCGCTATCC 3025 
MKIKQVSQHIKQSAAI 

TCAAGATCAGTGTGCCAAACACATACAGCACAACAAAGCCGCCAAAGCAA 3075 
LK I SVPNTYSTTKPPKQ 

TCCAGTGGCTGCCGCGACCTCAAGCTTACAATTCGACCGGCATGGATTCA 3 125 
SSGCRD LKL T I RPAW I Q 

GGCGGGTTTCCCAGTGCTAGTC ^G^CATA C^G^^G^C^A^^AA^G^ 3175 

Core 2 

GTATCGCAGTTCAGCTGGGCCAT AGCCAAATCA TGGCACTGTGCAAGGTC 3225 

^^m^^msm^^^^immmtimtm n a l c k v 

CAAAAAGAAACATGCCAAATGACAAGTACACGACCAGTCCTTGGTTGTGT 3275 
QKETCQMTSTRPVLGCV 

CCGGAGCACGATAGGACTTGGTTTCCTTCACACTTGTCTCATGGGAATCT 3325 
RST IGLGFLHTCLMG I 

TCCTTGCCGCACCCACATACCTGGTGTCACCTGTTGACTTTGCACAAAAC 3375 
FLAAPTYLVSPVDFAQN 

CCTAATATTCTGTTCCAAACGCTTTCGCGGTACAAGATCAAGGATGCATA 3425 
PNILFQTLSRYKIKDAY 
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TGCAAC'GAGTCAAATGTTGGACCACGCCATCGCACGCGGAGCTGGTAAGA 3475 
ATSQMLDHAIARGAGK 

GTATGGCTCTGCACGAGCTGAAGAATCTCATGATTGCGACTGATGGAAGA 3525 
SMALHELKNLMIATDGR 

CCACGCGTTGATGTTTgtaagtgaacatttgtatgagaggactttccrtga 3575 
P R V D V 

t tgctaac tcaotgcagACCAAAGAGTGCGTGTGCACTTTGCGCCAGCCA 3625 
YQRVRVHFAPA 

ACTTAGACCCAACCGCAATCAACACTGTCTACTCACATGTATTGAACCCA 3675 
NLD PTA I NTVYSHVLNP 

atggtagcatcacgatcatacatgtgtattgagccagtcgagctccatct 3725 
hyas'rsymciepvelhl 

cgatgtgcatgctctgcgacgcggcctcgtcatgcccgttgaccctgaca 3775 
dvhalrrglvmpvdpd 

cagagcccaacgctttgctcgtccaagactcgggcatggtgccagtgagc 3825 
tepnallvqdsgmvpv s 

acgcaaatatccattgtcaacccagagaccaaccaactgtgcttgaacgg 3875 
tqi8ivnpetnqlclng 

C^T^ff^^ 1 ^ 3925 
Core 3 

ACATGTCGAAAGAGCGCTTGGATGCAGAACGCTTCAATGGGAGGACGATT 3975 
YMSKERLDAER FNGRT I 



GACGGAGACCCAAATGTGCGATATGTTCGTACAGGCGATTTAGGATTTTT 4025 

o g d p n v r mssmsmmmmmmmmBML 

Core 4 

GCACAGCGTGACACGGCCCATTGGACCCAACGGTGCACCTGTTGATATGC 4075 
HSVTRPIGPNGAP VDM 

AGGTGCTTTTCGTGCTTGGAAGCATAGGTGACACTTTTGAAGTCAACGGA 4 1 25 
QVLFVLGSIGDTFEVNG 

CTGAACCATTTCTCTATGGACATTGAGCAGTCTGTTGAACGTTGTCACCG 4 1 75 
LNHFSMDIEQSYERCHR 
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GAATATTGTCCCTGGAGGCTGgtacgtttcttcgattcgctgttatttag 4225 
N I V P G G C 

taaatacttactaacactctacagTGCTGTTTTCCAGGCAGGTGGGCTTG 4275 
AVFQAGGL 

TTGTTGTCGTTGTGGAAATCTTCCGACGCAACTTCCTCGCAAGCATGGTG 4325 
VVVVVE IFRRNFLASMV 

CCTGTGATTGTCAATGCAATTTTGAACGAGCATCAGCTGGTCATTGACAT 4375 
PV I VNAILNEHQLV I D I 

TGTCTCGTTTGTGCAAAAGGGCGACTTCCACCGGTCTCGTCTGGGCGAGA 4425 
VSFVQKGDFHRSRLGE 

AGCAACGCGGAAAGATTCTTGCAGGATGGGTCACACGGAAGATGCGCACA 4475 
KQRGK I LAGWVTRKMRT 

ATAGCCCAGTACAGTATACGGGATCCTAATGGACAG£A/QDCCAGATGAT 4525 

IAQYSIRDPN ^KBBH^ M I 

Core 6 

CACGGAAGAGCCTGGTCCACGGGCTAGCATGACTGGAAGTATGCTTGGGC 4575 
TEEPGPRASHTGSMLG 

GAATGGGCGGCCCAGCCAGTATCAAGGCCGGGTCGACAAGAGCACCGAGT 4625 
RMGGPASIKAGSTRAPS 

CTAATGGGCATGACAGCGACTATGAATAATCTATCCCTTACACAGCAGCA 4675 
LMGMTATMNNLSLTQQQ 

ACAGCAGCAATACCAACAGCCGGGTATGTATGCTCAACAGCAAGGCATGC 4725 
QQQYQQPGHYAQQOGM 

ACCCCCAGCAACAACACCAATTTAGCATGTCCAACACGCCACCACAAGGT 4775 
HPQQQHQFSMSNTPPQG 

CCACCCCAAGGCGTAGAACTACATGATCCTAGCGACCGCACACCAACAGA 4825 
P PQGVELHOPSDRTPTD 

CAACCGGCACTCTTTCCTTGCCGACCCGCGTATGCAGAACCA ( GGGCCAAA 4875 
NRHSFLADPRMQNQGQ 

TGAACGAGACGGGCGCCTACGAACCCATGAACTATCAAAACGCGTATCAT 4925 
MNETGAYEPMNYQNAYH 

CCGCATCAACAACAATACGAATCTGAAGACGGGGGGAGCAGACTCAGCGG 4975 
PHQQQYESEDGGSRLSG 
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CCCCGTGCCAGACGTGCTGCGGCCGGGTCCTTCATCCGGGTCCATAGAGC 5025 
P V P D V L R P I G P S S G 1 S I E 

AGCACGACCAAGCTAACAACGACAACAATATGTGGAATAATCGCGAGTAC 5075 
QHDQAN NDNNMWNNREY 

TATGGTAACAGCCCATCGTATGCAGGCGGATACACGC AAGATGGCAAT AT 5 1 25 
YGNSPSYAGGYTQDGN I 

CCACGAGCAGCAACAACACGATGAGTACACGAGTAATGCGTCATATGGCG 5175 
HEQQQ HDEYTS NASYG 

GAAATCAAGGAGCAGGCGGAGGCAGCGGCGGCGGTGGCGGTCTCCGAGTT 5225 
GNQGAGGGSGGGGGLRV 

GCAAATCGTGACAGCTCCGACAGCGAGGGTGCAGATGACGACGCTTGGAG 5275 
ANRDSSDSEGADDDAWR 

ACGTGATGCCCTTGCTCAGATCAATTTTGCGGGCGGCGCTGGTGCTGCCT 5325 
RDALAQ INFAGGA AAA 

CCGCTGGAGCACCTGCTGCTGGTGCTTCTTCTTCGCAGCCGGGCCATGCG 5375 
SAGAPAAGASSSQPGHA 



CA GTAGA CGGGATATGCGTGAGTTTTTTTTTAAATTTCGTACATAGAGAC 
Q END 



5425 



CGTTGTATACGCAGGTTTCAAATTAGAAGAGCGAATATGCATATCAGCTG 5175 

TTGTTCAATGTTCTAGTTTGGGAAGGTTAACCCCCCCCCCTTCCCCTTCC 5525 

AAGACTTTTCACTTGTTTGTGTGTGATTTAAATCTGGAGATTTCAAATCT 5575 

ACATCTCGCTATACATAGGTGTTGTTTGATAACGTAGGGGGCAGAAGGGT 5625 

ATCTCGTGATATTAGACTGGGAGTTGCATGAATCAAGGTGTTGAGCAAAA 5675 

AAAGAGAGAGCGGTGAAGGGCGGGGGGGATAGGTGGTGTGCACGTGGCTG 5725 
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AAGCCAGGTACGCATCTGCGCAGGGCCCGGCAAGCCCAGTATGACGTTTT 
TACCCCAGAGCCAGCCTGCTCCTCGGCTTGGCGCCGCCTGGCTTAGTCAG 
CCCCCATCAGTCAGCGGCCAGTCACGTGTTCGGCGCGACAAGCTCCCACA 
TGGCCGATGATCATTGTGCCTCGGCTCATGCGCGTCTGCGCTGCTTTGAC 
GTGCCCCGGAAATGACGACGCCCAAAGTCTCACACAGCCGCTCGCTGCTG 

ctgcttgcatctggtcttgctacactggcctcctVttcgcagaacacaga 

ACGATCCACCAACAGG^CWTCAGCCACCACCMfiGCCACCCTCATC 

n A I L. l 

CGCCCCCCGCCCGCCGACCACTCCCAGTCGGCCATTGAGAACGTCCTTGA 
RPPPADHSQSAIENVLt 

ACTGACGCAGCTGGCCGACATTGATCCCAACATCTTCACAAACACTCGTC 
LTQLADIDPNIFTNTK 

CCCTATGGCATCCCCCCGGTGCGCGCGGCATCTTTGGCGGTGCTGCCATT 
PLWHPPGARGI FGGAA l 
GCCCAGACACTCAGCGCCGCCCAGAAGACGGTTGACCCCGACTTCACTGT 2 1 5 

aqtlsaaqktvdpdftM 

GCATTCGATGCACTGCTACTTCATCCTTGCAGGCAACTCCGAGATCCCCG 
HHCYFILAGNSEIP 

TCATATACCATGTAGAAAGGGTGCGCTCGGGAAAGTCGTTCGCAACCAGG 
v jYHVERVRSGKSFATR 



-286 
-236 
-186 
-136 
-86 
-36 
15 

65 

115 

165 



ACGGTGCAGGCCCGGCAAAGGGGCAACGTCATCTTCACTACGACCATGAG 
TVQARQRGNVIFTT1 n i> 

TTTTGTTCGCCAGAACAGTGGAGGCGCTCAAAAGGTCGAGCACATCTACC 
FVRQNSGGAQKVtH l t 

CGATGCCAGACGTCCCGGCGCCCAAGGAAGGTAGCGACGACTTGAAGACG 
PMPDVPAPKEGSDDL^ i 

CCCAACGATGGCCAGAGTCCCTTCCAGACCCAGCTGTTGCCCATTGAAAA 
PNDGQSPFQTQLLP 

CGgtgagtgacttcgcataccatccttaagcctgccctagctcatatcta 565 
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gCAGACGACTCCGACAAGCCCCACACCAAGAAATGCCGACAATGGATAAA 6 1 5 
ADDSDKPHTKKCRQW I K 

GGCTCGCGGTAAAATCTCCCCCGCTGGCGGTCACGAAGCCCATCTCTCCG 665 
ARGK I SPAGGHEAHLS 

CCATCGCATACATGTCCGATAGCTACTTCATCGGCACCGTCGCGCGTGCG 7 1 5 
AlAYMSDSYFIGTVARA 

CACAAGCTTCTGCGCTACTCGAACCAGCGCAAGAGCAGGGCCAGGTCGAG 765 
HKLLRYSNQRKSRARSS 

C ATCGACGAGGACGT ACTTAAGAAGCTGCTCGAGATGG ATG ATGCCGAGT 8 1 5 
I DEDVLKKLL EMDDAE 

TACAGCGCCAAAGCTTTGTCAACGAATCAGACAAGCAGCGCATACGTGAA 865 
LQRQSFVNESDKQR I RE 

TTGAGGAAAGCAGAAGACCTGGCAAAGAGCGGAGACGC AAAGCCTGAGAT 9 1 5 
LRKAEDLAKSGDAKPE I 

TGGCATGATGGTTAGTCTGGACCACACCATCTACTTTCACAATCCTCGCA 965 
GMMVSLDHTIYFHNPR 

GTTTCCGCGCAGATGAATGGATCTTCACCGAGATGGAGACCCCTTGGGCT 1015 
SFRADEWIFTEHETPWA 

GGTGATGGTCGCGGCCTCGTTTCCCAGAGAATGTATACCAAGGATGGCAC 1065 
G DGRGLVSQRMY TKDGT 

GCTCATTGCCAGCTGCGTCCAAGAGGTAAGCAGCTTGCTTATGTCTGCTA 1115 
LIASCVQEVSSLLMSA 

TAGTAGACTGTGCTTACATGAATAGGGCGTCATACGATIEAAGCAGAACG 1 165 
I VDCAYMNRASYD END 

AAAGT AAGCTATAGCCATGACCGTGTGTAGATTGCTCGTATGTCCGTGTA 1215 

CTTTGATTCGATTGACTTTTATGCCGCCGCCTTTCACTTGACCTTGTCGG 1 265 

AGTAGACTCCGCTGTTTCTATTGTTTTAGTAGCTTTAGATTTGGAGGTCT 1315 



GGAACACACGTACCTCACGTATACACTTTTAAATACAAGGTTCCTCGTTA 1 365 

CAAATAGCAACTCGCACTTGTGTAGTACACGACTTGGCACAGCAAGTGCA 1415 

TGCAGATGATGCGCACTGCATGCAGCCCTAGAAGCCTGCACCTGTGCTGT 1 465 

TCGTCCTTCCCACGTTACAGGCCAGGGTCCAATTAGCGCAATGCCGGTCT 1516 

CGGGCTAGCATGGCCCTCACACTTGGATTCTCCGAAGCTACCGTCGGCAG 1 565 
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